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Are Minimised In 


Activities Of 






Response To 


Pathways Not 


- Many Proteins In Cells 




Reactive Oxygen 


Maintained Under 


Undergoing Cell Death Or In 






Reactive Oxygen 


Damaged Cells 






Programmed Cell 








Death 





Further, promoters of reactive oxygen responsive genes, as described in the Reference 
tables, for example, are useful to modulate transcription that is induced by reactive oxygen or any of 
the following phenotypes or biological activities below. 
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V.G. SALICYLIC ACID RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Plant defense responses can be divided into two groups: constitutive and induced. 
Salicylic acid (SA) is a signaling molecule necessary for activation of the plant induced defense 
system known as systemic acquired resistance or SAR. This response, which is triggered by prior 
exposure to avirulent pathogens, is long lasting and provides protection against a broad spectrum 
of pathogens. Another induced defense system is the hypersensitive response (HR). HR is far 
more rapid, occurs at the sites of pathogen (avirulent pathogens) entry and precedes SAR. SA is 
also the key signaling molecule for this defense pathway. 

Changes in S A concentration in the surrounding environment or within a plant results in 
modulation of many genes and gene products. Examples of such SA responsive genes and gene 
products are shown in the Reference, Sequence, Protein Group, Protein Group Matrix, MA_diff 
and/or AFLP_diff MA diff and/or AFLP diff and/or AFLP diff and MA_clust tables. These 
genes and/or products are responsible for effects on traits such as plant vigor and seed yield. They 
were discovered and characterized from a much larger set by experiments designed to find genes 
whose mRNA products changed in response to S A treatment. 

While SA responsive polynucleotides and gene products can act alone, combinations of 
these polynucleotides also affect growth and development. Useful combinations include 
different SA responsive polynucleotides and/or gene products that have similar transcription 
profiles or similar biological activities, and members of the same or similar biochemical 
pathways. In addition, the combination of SA responsive polynucleotides and/or gene product 
with another environmentally responsive polynucleotide is also useful because of the interactions 
that exist between hormone regulated pathways, stress and pathogen induced pathways, 
nutritional pathways and development. Here, in addition to polynucleotides having similar 
transcription profiles and/or biological activities, useful combinations include polynucleotides 
that may have different transcription profiles but which participate in common and overlapping 
pathways. 

Such SA responsive genes and gene products can function to either increase or dampen 
the above phenotypes or activities either in response to changes in SA concentration or in the 
absence of SA fluctuations. The MA_diff and/or AFLPdiff MA_diff and/or AFLP_diff and/or 
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AFLPdiff Table(s) reports the transcript levels of the experiment (see EXPT ID: 108586, 
108587, 108515, 108552, 108471, 108472, 108469, 108470, 107953, 107960, 108443, 108440, 
108441, 108475, 108476). For transcripts that had higher levels in the samples than the control, 
a "+" is shown. A "-" is shown for when transcript levels were reduced in root tips as compared 
to the control. For more experimental detail see the Example section below. 
SA genes are those sequences that showed differential expression as compared to controls, 
namely those sequences identified in the MA diff and/or AFLP diff MA diff and/or AFLP_diff 
and/or AFLP diff tables with a "+" or "-" indication. 

SA Genes Identified By Cluster Analyses Of Differential Expression 

SA Genes Identified By Correlation To Genes That Are Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of SA genes is any group in the MA clust that comprises a cDNA 
ID that also appears inExpt ID 108586, 108587, 108515, 108552, 108471, 108472, 108469, 
108470, 107953, 107960, 108443, 108440, 108441, 108475, 108476 of the MA_diff and/or 
AFLP^diff MA_diff and/or AFLP diff and/or AFLP diff table(s). 

SA Genes Identified By Correlation To Genes That Cause Physiological Consequences 
Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of SA genes. A group in the MA clust is considered a 
S A pathway or network if the group comprises a cDNA ID that also appears in Knock-in or 
Knock-out tables that causes one or more of the phenotypes described in section above. 

SA Genes Identified By Amino Acid Sequence Similarity 

SA genes from other plant species typically encode polypeptides that share amino acid 
similarity to the sequences encoded by corn and Arabidopsis SA genes. Groups of SA genes are 
identified in the Protein Group table. In this table, any protein group that comprises a peptide ID 
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that corresponds to a cDNA ID member of a SA pathway or network is a group of proteins that 
also exhibits SA functions/utilities. 

Further, promoters of S A responsive genes, as described in the Reference tables, for 
example, are useful to modulate transcription that is induced by SA or any of the following 
phenotypes or biological activities below. 

V.G.I. USE OF SALICYLIC ACID-RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

1 0 SA responsive genes and gene products are useful to or modulate one or more of the 

following phenotypes: 

Pathogen Tolerance and/or Resistance 
Avr/R locus Interactions 

- Non-Host Interactions 

m - hr 

;p - SAR, e.g., SA Responsive Genes And/Or Products In Conjuction With Any Of The 

^ Organisms Listed Below 

m 

- Resistance To Bacteria e.g. to Erwinia stewartii, Pseudomonas syringae, 



m 

m 
°& - 

ru 



Pseudomonas tabaci, Stuart's wilt, etc. 
20 - Resistance To Fungi e.g. to Downy Mildews Such As Scleropthora macrospora, 

Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; Rusts Such As Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; And 
To Other Fungal Diseases e.g. Cercospora zeae-maydis, Colletotrichum graminicola, 
25 Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, Phytophthora 

parasitica, Peronospora tabacina, Septoria, etc.; 
- Resistance To Viruses Or Viroids e.g., To Tobacco Or Cucumber Mosaic Virus, 
Ringspot Virus, Necrosis Virus, Pelargonium Leaf Curl Virus, Red Clover Mottle 
Virus, Tomato Bushy Stunt Virus, And Like Viruses; 
30 - Resistance To Insecs, Such As To Aphids e.g. Myzus persicae; to Beetles And Beetle 

Larvae; to lepidoptera larvae e.g. Heliothus etc. 
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Resistance to Nematodes, e.g. Meloidogyne incognita etc 

- Local Resistance In Primary (Infected) Or Secondary (Uninfected) Leaves 
Stress Tolerance 

- Winter Survival 
Cold Tolerance 
Salt Tolerance 

- Heavy Metal Tolerance, Such As Cadmium 

- Tolerance To Physical Wounding; 

- Increased Organelle Tolerance To Redox Stress, Such As In Mitochondria, And 
Chloroplasts 

- Cell Death 

- Programmed Cell Death, Including Death Of Diseased Tissue And During 
Senescence 

Fruit Drop 
Biomass 

- Fresh And Dry Weight During Any Time In Plant Life, Such As Maturation 

- Number Of Flowers, Seeds, Branches, And/Or Leaves 

- Seed Yield, Including Number, Size, Weight, And/Or Harvest Index 

- Fruit Yield, Including Number, Size, Weight, And/Or Harvest Index 
Plant Development 

- Time To Fruit Maturity 

Cell Wall Strengthening And Reinforcement 

- Plant Product Quality 

- Paper Making Quality 
Food Additives 

- Treatment Of Indications Modulated By Free Radicals 
Cancer 

To regulate any of the desired phenotype(s) above, activities of one or more of the SA 
responsive genes or gene products can be modulated and the plants tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be assayed. As an example, a 
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plant can be transformed according to Bechtold and Pelletier (1998, Methods. MoL Biol. 
82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 
and visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Zhao et al. (1998, Plant Cell 10:359-70) and Alvarez et al. (1998, Cell 92: 733-84). 
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V.G.2. USE OF SALICYLIC ACID-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the SA responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. 
Such biological activities can be measured according to the citations included in the 
Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATION INCLUDING 
ASSAYS 


Protection From Microbial 
Pathogens 


Systemic Acquired Resistance 
(SAR) 

- Phytoalexin Biosynthesis 

- PR Protein Biosynthesis 
Local Resistance 
Wound Response 


Alvarez et al. (1998) Cell 
92: 733-84 

Lapwood et al. (1984) Plant 
Pathol. 33: 13-20 
Davis etal. (1993) 
Phytochemistry 32: 607-11 
Yahraus et al. (1995) Plant 
Physiol. 109: 1259-66 


Cell Signaling 


- Modulation Of Reactive 
Oxygen Signaling 

- Modulation Of No Signaling 


-Alvarez et al. (1998) Cell 
92: 773-784 
Delledonne et al. (1998) 
Nature 394: 585-588 


Growth And Development 


- Lignification 


Redman et al. (1999) Plant 
Physiol. 119: 795-804 



Other biological activities that can be modulated by the SA responsive genes and gene 
products are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 

Salicylic acid responsive genes are characteristically differentially transcribed in response 
to fluctuating SA levels or concentrations, whether internal or external to an organism or cell. 
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The MA_diff and/or AFLP_diff MA_diff and/or AFLP^diff and/or AFLP_diff table reports the 
changes in transcript levels of various SA responsive genes in entire seedlings at 1 and 6 hours 
after the seedling was sprayed with a Hoagland's solution enriched with SA as compared to 
seedlings sprayed with Hoagland's solution only. 

The data from this time course can be used to identify a number of types of SA 
responsive genes and gene products, including "early responders" and "delayed responders 
Profiles of these different SA responsive genes are shown in the Table below together with 
examples of the kinds of associated biological activities. 



GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITIES OF GENE 
PRODUCTS 


Upregulated Genes 
(Level At lh=6h) 
Or 

(Level At lh>6h) 


Early 

Responders To 
SA 


- SA Perception 

- SA Uptake 
-Modulation Of SA 
Response Transduction 
Pathways 


-Transcription Factors 
-Transporters, Kinases, 
Phosphatases, G- 
Proteins, LRR, DNA 
Remodelling Proteins 


Upregulated Genes 
(Level At lh<6h) 


Delayed 
Responders To 
SA 


opecinc iyeiensegene 
Transcription Initiation 
(E.G. Pr Genes, Pal 


-Proteases, PRProteins, 
Cellulases, Chitinases, 
Cutinases, Other 
Degrading Enzymes, Pal, 
Proteins Of Defense 
Pathways, Cell Wall 
Proteins 

Epoxide Hydrolases, 
Methyl Transferases 


Downregulated 


- Early 


- Negative Regulation 


Transcription factors, 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY 


CONSEQUENCES 


BIOCHFMTCAT 


LEVELS 


OF GENE 




ACTIVTTTFS OF GFNF 

-ti. V — 1 1 V 1 1 11_^lj v_/ 1 V, J 1 jIM ; 








PRODUCTS 


(Level At lh = 6h) 


Responder 


Of SA Inducible 

X ix 111UU^/1 Ly I\^ 


jviiiaacb, piiospnaidses, kj~ 


Or 


Repressors To 


PaLrlW^lV« : ; Released 




(Level At 6h >1h"l 

\ L/v V w I ill v/ 1 1 ^ 1 11 I 


SA 




transporters, calcium 




- vjciicb wren 




binding proteins, 




jL'ibconiinueQ 




chromatin remodelling 




Expression Or 




protein 




T JnsTable 








mRNA Tn The 

111J.V1 lil 111 1 11W 














Down-Resulated 


- DelaveH 


iNcgctuvc iveguiciiion i^i 


Transcription Factors, 


Transcripts 


ResDonders To 


o.ty inuuviuic x^cxiiiWctyo 


jvinases, rnospnatases, 


(Level At lh> 6h) 


SA Metabolism 


O plf»cicf*rl 
xvt^lCciaCU 


o-rroteins, lkk, 




VJCI1C5> Willi 




Transporters, Calcium 




Discontinued 




Binding Proteins, 




Expression Or 




Chromatin Remodelling 




UnsTable 




Protein 




mRNA In The 








Presence Of SA 







Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the SA responsive genes when the desired sequence is operably 
linked to a promoter of a SA responsive gene. 

5 
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V.H. NITRIC OXIDE RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

The rate-limiting element in plant growth and yield is often its ability to tolerate 
suboptimal or stress conditions, including pathogen attack conditions, wounding and the 
5 presence of various other factors. To combat such conditions, plant cells deploy a battery of 
inducible defense responses, including synergistic interactions between nitric oxide (NO), 
reactive oxygen intermediates (ROS), and salicylic acid (SA). NO has been shown to play a 
critical role in the activation of innate immune and inflammatory responses in animals. At least 
part of this mammalian signaling pathway is present in plants, where NO is known to potentiate 
10 the hypersensitive response (HR). In addition, NO is a stimulator molecule in plant 
*5 photomorphogenesis. 

•13 Changes in nitric oxide concentration in the internal or surrounding environment, or in 

J contact with a plant, results in modulation of many genes and gene products. Examples of such 
J l nitric oxide responsive genes and gene products are shown in the Reference and Sequence 

Tables. These genes and/or products are responsible for effects on traits such as plant vigor and 
|n seed yield. They were discovered and characterized from a much larger set by experiments 
|jB designed to find genes whose mRNA products changed in response to nitric oxide treatment. 

ills! 

:p While nitric oxide responsive polynucleotides and gene products can act alone, 

:|J combinations of these polynucleotides also affect growth and development. Useful combinations 
20 include different nitric oxide responsive polynucleotides and/or gene products that have similar 
transcription profiles or similar biological activities, and members of the same or similar 
biochemical pathways. Whole pathways or segments of pathways are controlled by transcription 
factor proteins and proteins controlling the activity of signal transduction pathways. Therefore, 
manipulation of the levels of such proteins is especially useful for altering phenotypes and 
25 biochemical activities of plants. In addition, the combination of a nitric oxide responsive 

polynucleotide and/or gene product with other environmentally responsive polynucleotides is 
also useful because of the interactions that exist between hormone regulated pathways, stress 
pathways, pathogen stimulated pathways, nutritional pathways and development. Here, in 
addition to polynucleotides having similar transcription profiles and/or biological activities, 
30 useful combinations include polynucleotides that may have different transcription profiles but 

which participate in common or overlapping pathways. The MA diff and/or AFLP_diff MA_diff 
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and/or AFLP_diff and/or AFLPdiff Table(s) reports the transcript levels of the experiment (see 
EXPTID: 108584, 108585, 108526, 108527, 108559). For transcripts that had higher levels in 
the samples than the control, a "+" is shown. A is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the Example 
section below. 

NO genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff and/or AFLPdiff MAdiff and/or 
AFLP^diff and/or AFLP_diff tables with a "+" or "-" indication 

NO Genes Identified By Cluster Analyses Of Differential Expression 

NO Genes Identified By Correlation To Genes That Are Differentially Expressed 
As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of NO genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ID 108584, 108585, 108526, 108527, 108559 of the MA_diff 
and/or AFLP diff MA diff and/or AFLP_diff and/or AFLP_diff table(s). 

NO Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of NO genes. A group in the MA_clust is 
considered a NO pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

NO Genes Identified By Amino Acid Sequence Similarity 

NO genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis NO genes. Groups of NO 
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genes are identified in the Protein Group table. In this table, any protein group that comprises a 
peptide ID that corresponds to a cDNA ID member of a NO pathway or network is a group of 
proteins that also exhibits NO functions/utilities. 

Such nitric oxide responsive genes and gene products can function either to increase or 
dampen the above phenotypes or activities either in response to changes in nitric oxide 
concentration or in the absence of nitric oxide fluctuations. Further, promoters of nitric oxide 
responsive genes, as described in the Reference tables, for example, are useful to modulate 
transcription that is induced by nitric oxide or any of the following phenotypes or biological 
activities below. 

V.H.I. USE OF NITRIC OXIDE-RESPONSIVE GENES TO MODULATE 
PHENOTYPES : 

Nitric oxide responsive genes and gene products are useful to or modulate one or more of 
the following phenotypes: 

• Stress Responses 

• Mediation of response to stresses 
Disease resistance 

Growth 
Roots 
Stems 
Leaves 
. Cells 

• Promotes leaf cell elongation 

• Biomass 

• Fresh and Dry Weight during any time in plant life, such as at maturation; 

• Size and/or Weight 

• Flowers 

• Seeds 

• Branches 
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Leaves 



Roots 



5 



Development 
Seed Development 
Dormancy 

Control rate and timing of germination 
Prolongs seed storage and viability 
Senescence 



10 Further, any desired sequence can be transcribed in similar temporal, tissue, or 

O enviromentally specific patterns as the nitric responsive genes when the desired sequence is 

h Q operably linked to a promoter of a nitric responsive gene. 

yS To regulate any of the desired phenotype(s) above, activities of one or more of the nitric 

jjj oxide responsive genes or gene products can be modulated and the plants tested by screening for 
|l the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
jm utilizing the procedures described herein and the phenotypes can be assayed. As an example, a 
JB plant can be transformed according to Bechtold and Pelletier (1 998) Methods. Mol. Biol. 82: 

259-266 and/or screened for variants as described in Winkler et al. (1 998) Plant Physiol. 118: 

743-50 and visually inspected for the desired phenotype. Alternatively, plants can be 
20 metabolically and/or functionally assayed according to Beligni and Lamattina (2000) Planta 210: 

215-21), Lapwood et al (1984) Plant Pathol 33: 13-20, and/or Brown and Botstein (1999) Nature 

Genet. 21:33-37. 



V.H.2. USE OF NITRIC OXIDE-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES: 

The activities of one or more of the nitric oxide responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as those 
noted below. Such biological activities can be measured according to the citations 
included in the Table below: 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 




Stress Response 


-Programmed Cell Death 


Levineetal(1996) Curr. 






Biol 6: 427-37 






Sellinsand Cohen (1991) 






Radiat. Res. 126: 88-95 




-Reactive Oxygen based Defence 


Kumar and Klessig (2000) 




Pathways 


Mol. Plant Microbe 






Interact. 13:347-351 


Disease Resistance 


-Microbial Pathogen resistance 


Lapwood et al (1984) Plant 




pathways 


Pathol 33: 13-20 






Kumar and Klessig (2000) 






Mol. Plant microbe 






interact. 13: 347-351 






Klessig et.al.(2000) Proc. 






Nat. Acad. Sci USA 97: 






8849-8855 






Delledonnaet al(1998) 






Nature 394: 585-588 




-Programmed Cell Death 


Levine et al (1 996) Curr. 






Biol 6: 427-437 






Sellins and Cohen (1991) 






Radiat. Res. 126: 88-95 




-Cellular Protectant Gene 


Brown and Botstein (1999) 




expression 


Nat Genet 21: 33-37 




- Phytoalexin Biosynthesis 


Davis etal. (1993) 






Phytochemistry 32: 607- 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 






611 



499 



PATENT 

Attorney Docket No. 2750-1481P 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Signal Transduction 


Regulation of hydrogen peroxide 


Wu et al. (1995) Plant Cell 




signaling 


7, 1357-1368 


Reorientation of nitrogen 


Induction of ribosomal proteins, 


This study. Standard 


metabolism 


asparagine synthesis, proteases, 


assays for detection of 




Rnases 


changes 


Reorientation of sugar and 


Induction of sugar transporters, 


This study. Standard 


energy metabolism 


ATPases, glycohydrolases, and 


assays for detection of 




glycolytic enzymes, for example 


changes 



Other biological activities that can be modulated by the NO responsive genes and gene 
products are listed in the Reference Tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 

NO responsive genes are characteristically differentially transcribed in response to 
fluctuating NO levels or concentrations, whether internal or external to an organism or cell. The 
MA_diff and/or AFLP_diff MA_diff and/or AFLP diff and/or AFLP_diff table(s) report(s)the 
changes in transcript levels of various NO responsive genes in aerial tissues at 1 and 6 hours 
after a plant was sprayed with a Silwett L-77 solution enriched with 5 mM sodium nitroprusside, 
which is an NO donor. These changes are in comparison with plants sprayed with Silwett L-77 v 
solution only. 

The data from this time course can be used to identify a number of types of NO 
responsive genes and gene products, including "early responders" and "delayed responders" 
Profiles of these different nitric oxide responsive genes are shown in the Table below together 
with examples of the kinds of associated biological activities. 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVEL 


GENE 




ACTIVITY 


Upregulated genes 


Early responder 


- NO Perception 


-Transcription Factors 


(level at 1 hour = 6 


repressors to NO 


- NO Uptake 




hours) 




- Modulation of NO 


-Transporters 


(level at 1 hour > 6 




Response Transduction 


-Pathogen responsive 


hours) 




Pathways 


proteins, salicylic and 








jasmonate pathway 








proteins 






Specific Gene 


-Proteins to provide 






Transcription Initiation 


defence against active 






of Pathways to 


oxygen e.g. glutathione 






Optimize NO Response 


transferase,ascorbate 






Pathways 


free radical reductase, 








ascorbate peroxidase, 








nitrilase, heat shock 








proteins 








-Proteins to reorient 








metabolism 








e.g.proteases, Rnases, 








proteasomes, 








asparagine synthetase, 








glycohydrolases, 








transporters 








-Proteins to inhibit 








transport of nitric oxide 








-Degradation enzymes 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVEL 


GENE 




ACTIVITY 


Upregulated 


Delayed NO 


- Maintenance of 


- NO Metabolic 


transcripts 


responders 


metabolism in presence 


Pathway enzymes 


(level at 1 hour < 6 




of High NO 


-Pathogen responsive 


hours) 




-Maintenace of disease 


proteins, salicylic and 






defence pathways 


jasmonate pathway 








proteins 






-Maintenance of 


-Proteins to provide 






pathways against 


defence against active 






reactive oxygen 


oxygen e.g. glutathione 






production 


transferase ? ascorbate 








free radical reductase, 








ascorbate peroxidase, 








nitrilase, heat shock 








proteins 






Maintenance of 


-Proteins to reorient 






different metabolic 


and sustain metabolism 






programs 


e.g.proteases,Rnases,pr 








oteasomes,asparagine 








synthetase,glycohydrol 








ases,transporters, 








-Proteins to inhibit 








transport of NO 






Selective cell death 


-Degradation enzymes 


Down Regulated 


-Early responders of 


Negative regulation of 


-Transcription factors 


Transcripts 


NO utilization 


NO utilization 


-Kinases and 


(level at 1 hours = 6 


pathways 


pathways released 


phosphatases 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVEL 


GENE 




ACTIVITY 


hours) 






-Chromatin 


(level at 6 hours > 1 






restructuring proteins 


hour) 


-Genes with 


Reorientation of 


- Transcription 




discontinued 


metabolism 


factors 5 metabolic 




expression or 




enzymes, kinases and 




unsTable mRNA 




phosphatases, 




following nitric oxide 




transporters, ribosomal 




uptake 




proteins 






Programmed cell death 


-Most proteins in cells 








undergoing cell death 


Down Regulated 


-Delayed responder 


Negative regulation of 


Transcription factors 


Transcripts 


repressors of NO 


NO utilization 


-Kinases and 


(level at 1 hour > 6 


stress metabolism 


pathways released 


phosphatases 


hours) 






-Chromatin 








restructuring proteins 




-Genes with 


Reorientation of 


-Transcription 




discontinued 


metabolism 


factors,metabolic 




expression or 




enzymes, kinases and 




unsTable 




phosphatases, 




mRNA following 




transporters, ribosomal 




nitric oxide uptake 




proteins. 






Programmed cell death 


-Most proteins in cells 








undergoing 








programmed cell death 
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Use of Promoters of NO Responsive Genes 

Promoters of NO responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
5 a similar temporal, tissue, or environmentally specific patterns as the NO responsive genes where 
the desired sequence is operably linked to a promoter of a NO responsive gene. The protein 
product of such a polynucleotide is usually synthesized in the same cells, in response to the same 
stimuli as the protein product of the gene from which the promoter was derived. Such promoter 
are also useful to produce antisense mRNAs to down-regulate the product of proteins, or to 
10 produce sense mRNAs to down-regulate mRNAs via sense suppression. 




m 
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V.I. OSMOTIC STRESS RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

The ability to endure and recover from osmotic and salt related stress is a major 
determinant of the geographical distribution and productivity of agricultural crops. Osmotic 
stress is a major component of stress imposed by saline soil and water deficit. Decreases in yield 
and crop failure frequently occur as a result of aberrant or transient environmental stress 
conditions even in areas considered suitable for the cultivation of a given species or cultivar. 
Only modest increases in the osmotic and salt tolerance of a crop species would have a dramatic 
impact on agricultural productivity. The development of genotypes with increased osmotic 
tolerance would provide a more reliable means to minimize crop losses and diminish the use of 
energy-costly practices to modify the soil environment. 

Changes in the osmotic concentration of the surrounding environment or within a plant 
results in modulation of many genes and gene products. Examples of such osmotic stress 
responsive genes and gene products, including salt responsive genes, are shown in the Reference, 
Sequence, Protein Group, Protein Group Matrix, MAdiff and/or AFLP_diff MA_diff and/or 
AFLP diff and/or AFLP diff and MA clust tables. These genes and/or products are 
responsible for effects on traits such as plant vigor and seed yield. 

While osmotic and/or salt stress responsive polynucleotides and gene products can act 
alone, combinations of these polynucleotides also affect growth and development. Useful 
combinations include different osmotic stress responsive polynucleotides and/or gene products 
that have similar transcription profiles or similar biological activities, and members of the same 
or similar biochemical pathways. In addition, the combination of an osmotic stress responsive 
polynucleotide and/or gene product with another environmentally responsive polynucleotide is 
also useful because of the interactions that exist between hormone regulated pathways, stress 
pathways, nutritional pathways and development. Here, in addition to polynucleotides having 
similar transcription profiles and/or biological activities, useful combinations include 
polynucleotides that may have different transcription profiles but which participate in a common 
pathway. 

Such osmotic and/or salt stress responsive genes and gene products can function to either 
increase or dampen the above phenotypes or activities either in response to changes in osmotic 
concentration or in the absence of osmotic fluctuations. The MA diff and/or AFLP_diff 
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MA_diff and/or AFLP diff and/or AFLP diff Table(s) reports the transcript levels of the 
experiment (see EXPT ID: 108570, 108571, 108541, 108542, 108553, 108539, 108540). For 
transcripts that had higher levels in the samples than the control, a "+" is shown. A "-" is shown 
for when transcript levels were reduced in root tips as compared to the control. For more 
experimental detail see the Example section below. 

Osmotic Stress genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff and/or AFLP_diff 
MA_diff and/or AFLP diff and/or AFLP diff tables with a "+" or indication 

Osmotic Stress Genes Identified By Cluster Analyses Of Differential Expression 

Osmotic Stress Genes Identified By Correlation To Genes That Are Differentially 
Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Osmotic Stress genes is any group in the MA clust that 
comprises a cDNA ID that also appears in Expt ID 108570, 108571, 108541, 108542, 108553, 
108539, 108540 of the MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or 
AFLP_diff table(s). 

Osmotic Stress Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Osmotic Stress genes. A group in the MA_clust is 
considered a Osmotic Stress pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 

Osmotic Stress Genes Identified By Amino Acid Sequence Similarity 
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Osmotic Stress genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Osmotic Stress genes. 
Groups of Osmotic Stress genes are identified in the Protein Group table. In this table, any 
protein group that comprises a peptide ID that corresponds to a cDNA ID member of a Osmotic 
5 Stress pathway or network is a group of proteins that also exhibits Osmotic Stress 
functions/utilities. 



Further, promoters of osmotic stress responsive genes, as described in the Reference tables, 
for example, are useful to modulate transcription that is induced by osmotic stress or any of the 
1 0 following phenotypes or biological activities below. 

■.ft V.I.I. USE OF OSMOTIC STRESS RESPONSIVE GENES TO MODULATE 

— . : 

JJj PHENOTYPES 

^] Osmotic stress responsive genes and gene products are useful to or modulate one or more 

|j1 5 of the following phenotypes : 



Growth 



- Roots 

iiij 

- Stems 

- Leaves 

20 - Development 

- Cell Growth 

- DNA Synthesis and Cell Division 
Seed Development 

Desiccation tolerance 
25 - Dormancy 

Control rate of Germination 
Prolongs Seed Storage and Viability 
Senescence 
Stress Responses 
30 - Desiccation 

Drought 
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Salt 



To regulate any of the phenotype(s) above, activities of one or more of the osmotic stress 
responsive genes or gene products can be modulated and the plants tested by screening for the 
5 desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be assayed. As an example, a 
plant can be transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 
82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 
and visually inspected for the desired phenotype or metabolically and/or functionally assayed 
10 according to de Castro (1998, Phytochemistry 47: 689-694), Xu (1998, J Exp Bot 49: 573-582), 

Ausubel et al. (In: Current Protocols in Molecular Biology (1999) Volume 1, chapter 4, eds. 
Q Ausubel, Brent, Kingston, Moore, Seidman, Smith and Struhl, New York, NY) and De Castro et 
al. (2000, Plant Physiol 122: 327-36) 



m 



V.I.2. USE OF OSMOTIC STRESS RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 



The activities of one or more of the osmotic stress responsive genes can be 



TO 

4« modulated to change biochemical or metabolic activities and/or pathways such as those 



noted below. Such biological activities can be measured according to the citations 
20 included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 


Cell Growth And 
Differentiation 


Regulation Of Osmolyte 
Synthesis 


Yoshuetal. (1995) 
The Plant Journal 
7:751-60 




Regulation Of Glycolate Pathway 
And Photoinhibition Of 
Photosystem II In Response To 
Stress 


Strebetal. (1993) 
Physiologia 
Plantarum. 88:590- 
598 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 


Gene Regulation 


Transcriptional Regulation Of 
Osmotic Stress Induced Proteins 
Through DNA Binding Proteins 


Current Protocols in 
Molecular Biology / edited 
by Frederick M. Ausubel .. 
[et al.]. New York : 
Published by Greene Pub. 
Associates and Wiley- 
Interscience : J. Wiley, 
cl987 




Transcriptional Regulation Of 
Osmotic Stress Induced Proteins 
Through Protein Phosphorylation 
And Dephosphorylation 


Jonak (1996) Proceedings 
of the National Academy of 
Sciences of the United 
States of America, 93: 
11274-11279; 

Monroy, A.et al., 

(1998) 
Analytical Biochemistry 
265: 183-185; 




Regulation Of Osmotic Stress 
Induced Gene Protein 
Accumulation By Protein Protein 
Intereaction Between Osmotic 
Stress Regulated Genes And 
Protein Phosphatase 2C 


McCright(1998) IN: 
Methods in Molecular 
Biology; Protein 
phosphatase protocols; 
Ludlow (1998) Humana 
Press Inc. ; Suite 808, 999 
Riverview Drive, Totowa, 
New Jersey 

07512, USA. :263-277. 




Transcriptional Regulation Of 
Heat Induced Genes Through 


Luo and Dean (1999) 
Journal of the 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 




Chromatin Remodeling 


National Cancer 
Institute 91: 1288- 
1294; 

Chromatin 
protocols (1999) 
edited by Peter B. 
Becker. Totowa, 
N.J. : Humana 
Press 




Activity Of Abcisic Acid 
Regulated DNA Binding Proteins 


Gubler et al. (1999) 
Plant Journal 17: 1- 
9 




Accumulation Of RNA Binding 
Proteins That Regulate Osmotic 
Stress 


Sato (1995) 
Nucleic Acids Research 23: 
2161-2167. 


Stress Response 


Synthesis And Metabolism Of 
Osmoprotectants Such As 
Betaine, Proline And Trehalase 


Minocha et al. 
(1999) Plant 
Physiol and 
Biochem 37: 597- 
603 




Regulation Of Sugar Transporters 


Dejardin et al. 
(1999) Biochem J; 
344 Pt 2:503-9 




Regulation Of Vacuolar 
Sodium/Proton Antiport Activity 
And The Detoxification Of 
Cations 


Gaxiola et al. 
(1999) PNAS USA 
96: 1480-1485 




Regulation Of Intracellular Na+ 


Espinoza-Ruiz et 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 




And Li+ Ion Concentrations 


al. (1999) The Plant 
Journal 20: 529-539 




Regulation Of Universal Stress 
Protein Homologue Activity By 
Phosphorylation And 
Dephosphorylation. 


Freestone et al. 
(1997) Journal of 
Molecular Biology, 
v. 274:318-324 




Regulation/Maintenance Of 
Protein Stability During Thermal 
Stress 


Walker (1996) 
Humana Press Inc. 
Suite 808, 999 
Riverview Drive, 
Totowa, New 
Jersey 07512, USA 




Regulation Of Protein 
Degradation During Thermal 
Stress. 


Vierstra(1996) Plant 
Molecular Biology,32:275- 
302. 

Vierstra and Callis 
(1999) Plant 
Molecular Biology, 
41:435-442 


Signal Transduction 


Activation Of Stress Response 
Genes 


Xinong et al. 
(1999) The Plant 
Journal 19: 569-578 




Salt Tolerance 


Piao(1999) Plant 
Physiol 19: 1527- 
1534 




Calcium Mediated Stress 
Response 


Subbaiah et al. 
(1994) Plant 
Physiology 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 






105:369-376 
Kudla et al.(1999) 
PNAS USA 96: 
4718-4723 



Other biological activities that can be modulated by the osmotic stress responsive genes 
and gene products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Osmotic stress responsive genes are characteristically differentially transcribed in 
response to fluctuating osmotic stress levels or concentrations, whether internal or external to an 
organism or cell. MA__diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff 
table reports the changes in transcript levels of various osmotic stress responsive genes in aerial 
tissues of plants at 1 and 6 hours after the plants were sprayed with Hoagland's solution 
containing 20% PEG as compared to aerial tissues from plants sprayed with Hoagland's solution 
only. 

The data from this time course can be used to identify a number of types of 
osmotic stress responsive genes and gene products, including "early responding," "sustained 
osmotic stress responders," "repressors of osmotic stress pathways" and "osmotic stress 
responders." Profiles of these different osmotic stress responsive genes are shown in the Table 
below together with examples of the kinds of associated biological activities. 



GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY OF 
GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITIES OF 
GENE PRODUCTS 


Up Regulated 
Transcripts 
(Level At 1 Hour = 
6 Hours) 


Early Responders 
To Osmotic 
Stress 


Osmotic Stress 
Perception 

• Osmolyte Uptake 

• Modulation Of 


• Transcription 
Factors 
Transcription 
Coactivators 
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(Level At 1 Hour > 


Universal Stress 


Osmotic Stress 


• Membrane 


6 Hours) 


Response Genes 


Response Signal 


Transporters 






Transduction Pathways 


Proline 




♦ Osmotic Stress 


Specific Gene 


Biosynthesis 




Responders 


Transcription 


• Selective Inhibition 






Initiation 


Of Osmolyte 




♦ Abscisic Acid 


• Specific Gene 


Transport 




Biosynthesis And 


Transcription 


Protein 




Perception 


Repression 


Ubiquitination 






• Translation Activation 


Protein 






Translation 


Degradation 






Repression 


Rna Binding 






Repression Of 


Proteins 






"Normal State" 


Modification Of 






Pathways To Optimize 


Protein Activity By 






Osmotic Stress 


Phosphatases, 






Response 


Kinases 






• Activation Of Stress 


• Synthesis And Or 






Signaling Pathways 


Activation Of 






• Up Regulation Of 


Oxide Hydrolases, 






Abscisic Acid 


Suoeroxidedismuta 






Biosynthesis Pathway 


se, Iron Ascorbate 






Protein Accumulation 


Peroxidase 






And Activity 


• Activation Of 






Scavenging Reactive 


Signaling Pathway 






Oxygen Species 


By Calcium 






Modification Of Cell 


Binding Proteins, 






Wall Composition 


• Modification Of 






Up-Regulation Of 


Protein Activity By 






Universal Stress 


Protein-Protein 






Response Protein 


Interaction 
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Accumulation 


Change In 








Chromatin 








Structure And/Or 








Localized Dna 








Topology 








• Modification Of 








Pre-Existing 








Translation Factors 








By Phosphorylation 








(Kinases) Or 








Dephosphorylation 








(Pho sphatases) 








Synthesis Of New 








Translation Factors 








• Abscisic Acid 








Biosynthesis 


Up Regulated 


Sustained 


Osmolyte Adjustment 


Osmotic Stress 


Transcripts 


Osmotic Stress 


And Adaptation 


Metabolic 


(Level At 1 Hr < 6 


Responders 


• Photosynthetic 


Pathways 


Hr) 




Activity Modification 


Sugar Biosynthetic 




• Repressor Of 


Activation Of 


Pathways 




Osmotic Stress 


"Normal State" 


• Sugar Transporters 




Pathways 


Biosynthesis Genes 


• Transcription 






Negative Regulation 


Factors 




• Abscisic Acid 


Of Osmotic Stress 


• Transcription 




Perception, 


Pathways 


Coactivators 




Biosynthesis And 


Negative Regulation 


• Membrane 




Regulation 


Of Abscisic Acid 


Transporters 






Biosynthesis 


Abscisic Acid 






Acivation Of Abscisic 


Biosynthesis 






Acid Degradation 
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Pathway 
• Cell Wall 
Composition 
Modification 




Down-Regulated 


Early Responder 


Metabolic Repression 


Transcription 


Transcripts 


Repressors Of 


Specific Gene 


Factors 


(Level At 1 Hr « 6 


"Normal" State Of 


Transcription 


• Transcription 


Hr) 


Metabolism 


Initiation 


Coactivators 


(Level At 6 Hr > 1 




Specific Gene 


♦ Protein 


Hr) 


Negative Regulators 


Transcription 


Degradation 




Of Abscisic Acid 


Repression 


• Rna Binding 




Biosynthesis And 


• Translation Activation 


Proteins 




Perception. 


Translation 


Modification Of 






Repression 


Protein Activity By 




Positive Regulators 


Abscisic Acid 


Phosphatases, 




Of "Normal State" 


Degradation 


Kinases 




Metabolic Pathways. 


• Protein Degradation 


Activation Of 








Signaling Pathway 








By Calcium 








Binding Proteins, 








Modification Of 








Protein Activity By 








Protein-Protein 








Interaction 








Change In 








Chromatin 








Structure And/Or 








Localized Dna 








Topology 








• Modification Of 








Pre-Existing 



515 



PATENT 

Attorney Docket No. 2750-1481P 









Translation Factors 
By Phosphorylation 
(Kinases) Or 
Dephosphorylation 
(Phosphatases) 
Synthesis Of New 
Translation Factors 


Down-Regulated 


Repressors Of 


• Osmotic Stress 


• Transcription 


Transcripts 


"Normal" State Of 


Adaptation 


Factors 


(Level At 1 Hr>6 


Metabolism 


Negative Regulation 


• Transcription 


Hr) 




Of Abscisic Acid 


Coactivators 




Genes With 


Biosynthesis 


• Protein 




Discontinued 


Negative Regulation 


Degradation 




Expression Or 


Of Osmotic Stress 


Rna Binding 




UnsTable mRNA In 


Response Pathways 


Proteins 




Presence Of Osmotic 


Genes 


Modification Of 




Stress 


Osmolyte Synthesis 


Protein Activity By 






And Osmolyte 


Phosphatases, 




Repressor Of 


Cellular Partitioning 


Kinases 




Osmotic Stress 


Readjustment 


Activation Of 




Pathways 


• Activation Of 


Signaling Pathway 






"Normal State" 


By Calcium 




Repressors Of 


Metabolic Pathways 


Binding Proteins, 




Abscisic Acid 




• Modification Of 




Biosynthesis, 




Protein Activity By 




Perception And 




Protein-Protein 




Regulation 




Interaction 








Change In 








Chromatin 








Structure And/Or 








Localized Dna 
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Topology 
Modification Of 
Pre-Existing 
Translation Factors 
By Phosphorylation 
(Kinases) Or 
Dephosphorylation 
(Phosphatases) 

• Synthesis Of New 
° Translation Factors 

• Sugar Biosynthetic 
Pathways 

Sugar Transporters 



Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the osmotic stress responsive genes when the desired sequence is 
operably linked to a promoter of an osmotic stress responsive gene. 
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V.J. ALUMINUM RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Aluminum is toxic to plants in soluble form (Al 3+ ). Plants grown under aluminum stress 
have inhibited root growth and function due to reduced cell elongation, inhibited cell division 
5 and metabolic interference. As an example, protein inactivation frequently results from 

displacement of the Mg2+ cofactor with aluminum. These types of consequences result in poor 
nutrient and water uptake. In addition, because stress perception and response occur in the root 
apex, aluminum exposure leads to the release of organic acids, such as citrate, from the root as 
the plant attempts to prevent aluminum uptake. 
10 The ability to endure soluble aluminum is a major determinant of the geographical 

^ distribution and productivity of agricultural crops. Decreases in yield and crop failure frequently 
occur as a result of aberrant, hot conditions even in areas considered suiTable for the cultivation 



yi of a given species or cultivar. Only modest increases in the aluminum tolerance of crop species 

m 

I would have a dramatic impact on agricultural productivity. The development of genotypes with 
4 'ft increased aluminum tolerance would provide a more reliable means to minimize crop losses and 

Ip diminish the use of costly practices to modify the environment. 

CO 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two differentially 
labeled fluorescent cDNA pools to glass slides that contain spots of DNA (Schena et al. (1995) 
20 Science 270 : 467-70). The Arabidopsis Functional Genomics Consortium (AFGC) has recently 
made public the results from such microarray experiments conducted with AFGC chips 
containing 10,000 non-redundant ESTs, selected from 37,000 randomly sequenced ESTs 
generated from mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
25 controls were identified, compared to the Ceres full-length cDNA and genomic sequence 

databanks, and identical Ceres clones identified. MA_diff and/or AFLP_diff MA diff and/or 
AFLPdiff and/or AFLPdiff table reports the results of this analysis, indicating those Ceres 
clones which are up or down regulated over controls, thereby indicating the Ceres clones which 
are aluminum response responsive genes. 
30 The MA__diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff Table(s) 

reports the transcript levels of the experiment (see EXPT ID: Aluminum (relating to SMD 7304, 
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SMD 7305)). For transcripts that had higher levels in the samples than the control, a "+" is 
shown. A "-" is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Aluminum genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MAdiff and/or AFLP diff MA diff and/or 
AFLP_diff and/or AFLPdiff tables with a "+" or "-" indication 

Aluminum Genes Identified By Cluster Analyses Of Differential Expression 

Aluminum Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Aluminum genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Aluminum (relating to SMD 7304, SMD 
7305) of the MA^diff and/or AFLP_diff MA_diff and/or AFLP diff and/or AFLP diff table(s). 

Aluminum Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Aluminum genes. A group in the MA_clust is 
considered a Aluminum pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Aluminum Genes Identified By Amino Acid Sequence Similarity 

Aluminum genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Aluminum genes. 
Groups of Aluminum genes are identified in the Protein Group table. In this table, any protein 
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group that comprises a peptide ID that corresponds to a cDNA ID member of a Aluminum 
pathway or network is a group of proteins that also exhibits Aluminum functions/utilities. 



5 V.J. 1 . USE OF ALUMINUM RESPONSE GENES TO MODULATE 

PHENOTYPES 

Changes in aluminum concentrations in a plant's surrounding environment results 
in modulation of many genes and gene products. Examples of such aluminum response 
genes and gene products are shown in the Reference and Sequence Tables. These genes 
10 and/or products are responsible for effects on traits such as plant vigor and seed yield. 

While aluminum responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 

[ iH combinations include different aluminum responsive polynucleotides and/or gene 

;{3l 

j?y products that have similar transcription profiles or similar biological activities, and 

members of the same or similar biochemical pathways. In addition, the combination of a 
G3 aluminum responsive polynucleotide and/or gene product with another environmentally 

f{j responsive polynucleotide is also useful because of the interactions that exist between 

J hormone regulated pathways, stress pathways, nutritional pathways and development. 

!h* Here, in addition to polynucleotides having similar transcription profiles and/or 

20 biological activities, useful combinations include polynucleotides that may have different 

transcription profiles but which participate in a common pathway. 

Such aluminum responsive genes and gene products can function to either increase 
or dampen the above phenotypes or activities either 
• in response to changes in aluminum concentration or 
25 • in the absence of aluminum fluctuations. 

More specifically, aluminum responsive genes and gene products are useful to or 
modulate one or more of the following phenotypes: 
Growth 
Roots 

30 Inhibition of root elongation 

Stems 
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Leaves 
- Whole Plant 
Development 

- Cell Growth 

Elongation 
Division 

- Mediates response to oxidative stress, calcium-mediated defense, antioxidant 
defense and pathogenesis 

To produce the desired phenotype(s) above, one or more of the aluminum 
response genes or gene products can be tested by screening for the desired trait. 
Specifically, the gene, mRNA levels, or protein levels can be altered in a plant utilizing 
the procedures described herein and the phenotypes can be assayed. As an example, a 
plant can be transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 
82:259-266) and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed according to Li and Fleming (1999, FEBS Lett 461 : 1-5), Delhaize 
et al. (1999, J Biol Chem 274: 7082-8), Sigimoto and Sakamoto (1997, Genes Genet Syst 
72: 311-6), Esaki et al. (2000, Plant Physiol 122: 657-65), Leonard and Gerber (1988, 
Mutat Res 196: 247-57), Baisakhi et al. (2000, Mutat Res 465: 1-9), Ma (2000, Plant Cell 
Physiol 41: 383-90) and Koyama et al. (1999, Plant Cell 40: 482-8) 

Alternatively, the activities of one or more of the aluminum responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included in the 
Table below: 



GENERAL 
CATEGORY 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


ASSAY 


Cell Growth and 
Development 


-Phospholipase D (PLD) 
activity 


Todaetal. (1999) 
Biosci Biotechnol 
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-Regulation of 
Phosphtidylserine 
Synthase (PSS) 


Biochem 63: 210- 
212 




-Cell wall strengthening 


Hameletal. (1998) 
Planta 205: 531-38 


Stress Response 


-Regulation of oxidative 
stress 


Esaki et al. (2000) 
Plant Physiol 122: 
657-655 




-Regulation of 
antioxidant defense and 
DNA repair 


Baisakhi et al. 
(2000) Mutat Res 
465: 1-9 




-Secretion of Organic 
Acids (e.g. maleate, 
citrate) from root apex 


Koyama et al. 
(1999) Plant Cell 
40: 482-8 




-Ca2+mediated Defense 
Responses Against Low 
pH 


Pliethetal. (1999) 
Plant J 18: 634-50 


Signaling 


-H+ transport 
-Auxin transport 


Degenhardt et al. 
(1988) Plant Physil 
117: 19-27 
Rashotte et al. 
(2000) Plant 
Physiol 122:481- 
90 
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Other biological activities that can be modulated by aluminum response genes and their 
products are listed in the REFERENCE Table. Assays for detecting such biological activities are 
described in the Protein Domain table. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


responders to 


• Aluminum 


• Transporters 


transcripts 


aluminum 


perception 


• Metabolic enzymes 




application 


Aluminum uptake 


• Change in cell 






and transport 


membrane structure 






Aluminum 


and potential 






metabolism 


• Kinases and 






♦ Synthesis of 


phosphatases 






secondary 


• Transcription 






metabolites and/or 


activators 






proteins 


Change in chromatin 








structure and/or 






Modulation of 


localized DNA 






aluminum 


topology 






response 


• 






transduction 








pathways 








Specific gene 








transcription 








initiation 




Down-regulated 


responder to 


• Negative 


• Transcription factors 


transcripts 


aluminum 


regulation of 


Change in protein 




repressors of 


aluminum 


structure by 




aluminum state of 


pathways 


phosphorylation 




metabolism 




(kinases) or 






• Changes in 


dephosphorylation 




Genes with 


pathways and 


(phosphatases) 



524 



PATENT 

Attorney Docket No. 2750-1481P 



TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 




discontinued 
expression or 
unsTable mRNA in 
presence of aluminum 


processes 
operating in cells 
• Changes in other 
metabolisms than 
aluminum 


Change in chromatin 
structure and/or DNA 
topology 

Stability of factors for 
protein synthesis and 
degradation 
♦ Metabolic enzymes 



w 

ii Pi 

Use of Promoters of Aluminum Responsive Genes 
flJ Promoters of Aluminum responsive genes are useful for transcription of any desired 

m 

polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
,:Jf5 a similar temporal, tissue, or environmentally specific patterns as the Aluminum responsive 
ijy genes where the desired sequence is operably linked to a promoter of a Aluminum responsive 
iik gene. The protein product of such a polynucleotide is usually synthesized in the same cells, in 
^ response to the same stimuli as the protein product of the gene from which the promoter was 

derived. Such promoter are also useful to produce antisense mRNAs to down-regulate the 
1 0 product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 

suppression. 
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V.K. CADMIUM RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Cadmium (Cd) has both toxic and non-toxic effects on plants. Plants exposed to non- 
toxic concentrations of cadmium are blocked for viral disease due to the inhibition of systemic 
movement of the virus. Surprisingly, higher, toxic levels of Cd do not inhibit viral systemic 
movement, suggesting that cellular factors that interfere with the viral movement are triggered by 
non-toxic Cd concentrations but repressed in high Cd concentrations. Furthermore, exposure to 
non-toxic Cd levels appears to reverse posttranslational gene silencing, an inherent plant defense 
mechanism. Consequently, exploring the effects of Cd exposure has potential for advances in 
plant disease control in addition to soil bio-remediation and the improvement of plant 
performance in agriculture. 

Changes in cadmium concentrations in a plant's surrounding environment results in 
modulation of many genes and gene products. Microarray technology allows monitoring of gene 
expression levels for thousands of genes in a single experiment. This is achieved by 
simultaneously hybridizing two differentially labeled fluorescent cDNA pools to glass slides that 
contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US Arabidopsis 
Functional Genomics Consortium (AFGC) has recently made public the results from such 
microarray experiments conducted with AFGC chips containing some 1 0,000 non-redundant 
ESTs, selected from about 37,000 randomly sequenced ESTs generated from mRNA of different 
tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants 
treated with 10 jaM cadmium compared with untreated plants were identified, compared to the 
Ceres full length cDNA and genomic sequence databanks, and the equivalent Ceres clones 
identified. The MA diff and/or AFLP__diff MA_diff and/or AFLP_diff and/or AFLP_diff 
table(s) report(s) the results of this analysis, indicating those Ceres clones which are up or down 
regulated over controls, thereby indicating the Ceres clones which represent cadmium responsive 
genes. 

Examples of such cadmium responsive genes and gene products are shown in the 
Reference and Sequence Tables. These genes and/or products are responsible for effects on traits 
such as plant vigor and seed yield. 

While cadmium responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful combinations 
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include different cadmium responsive polynucleotides and/or gene products that have similar 
transcription profiles or similar biological activities, and members of the same or similar 
biochemical pathways. Whole pathways or segments of pathways are controlled by transcription 
factor proteins and proteins controlling the activity of signal transduction pathways. Therefore, 
manipulation of such protein levels is especially useful for altering phenotypes and biochemical 
activities of plants. In addition, the combination of a cadmium responsive polynucleotide and/or 
gene product with other environmentally responsive polynucleotides is also useful because of the 
interactions that exist between, for example, stress and pathogen induced pathways, nutritional 
pathways and development. Here, in addition to polynucleotides having similar transcription 
profiles and/or biological activities, useful combinations include polynucleotides that may have 
different transcription profiles but which participate in common or overlapping pathways. 

The MA diff and/or AFLP_diff MA__diff and/or AFLP_diff and/or AFLP_diff Table(s) 
reports the transcript levels of the experiment (see EXPT ID: Cadium (relating to SMD 7427, 
SMD 7428)). For transcripts that had higher levels in the samples than the control, a "+" is 
shown. A is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Cadium genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff and/or AFLP_diff MA diff and/or 
AFLP_diff and/or AFLP_diff tables with a or "-" indication 

Cadium Genes Identified By Cluster Analyses Of Differential Expression 

Cadium Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Cadium genes is any group in the MA clust that 
comprises a cDNA ID that also appears in Expt ID Cadium (relating to SMD 7427, SMD 7428) 
of the MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff table(s). 
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Cadium Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Cadium genes. A group in the MA_clust is 
considered a Cadium pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Cadium Genes Identified By Amino Acid Sequence Similarity 

Cadium genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Cadium genes. Groups 
of Cadium genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Cadium pathway or 
network is a group of proteins that also exhibits Cadium functions/utilities. 

Such cadmium responsive genes and gene products can function to either increase or 
dampen phenotypes or activities either in response to changes in cadmium concentration or in the 
absence of cadmium fluctuations. Further, promoters of cadmium responsive genes, as described 
in the Reference tables, for example, are useful to modulate transcription that is induced by 
cadmium or any of the following phenotypes or biological activities below. 

V.K.I. USE OF CADMIUM RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE PHENOTYPES 
Cadmium responsive genes and gene products are useful to or modulate one or more of 
the following phenotypes: 
Growth 
Roots 
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Initiation and maintenance of cell division 

• Stems 
Leaves 
Development 

• Mitochondria 

• Post-embryonic root meristem development 

• Senescence 
Stress Response 

Modulation of Jasmonic Acid and other stress control pathways 
Metabolic detoxification 
Heavy metals 

- Plant and Seed Yield; Fruit Yield 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
environmentally specific patterns as the cadmium responsive genes when the desired 
sequence is operably linked to a promoter of a cadmium responsive gene. 

To regulate any of the phenotype(s) above, activities of one or more of the 
cadmium responsive genes or gene products can be modulated and tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered 
in a plant utilizing the procedures described herein and the phenotypes can be assayed. 
As an example, a plant can be transformed according to Bechtold and Pelletier (1998) 
Methods. Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) 
Plant Physiol 118 : 743-50 and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Ghoshroy et al. (1998, Plant J 13 : 
591-602), Citovsky et al. (1998, Plant J ^6: 13-20), Clemens et al. (1999, EMBO J 18: 
3325-33), Chen et al. (2000, Chemosphere 41 : 229-34), Xian and Oliver (1998, Plant Cell 
10: 1539-90), Romero-Peurtas et al. (1999, Free Rad Res 31: S25-31), Gaur and Noraho 
(1995, Biomed Environ Sci 8: 202-10), Thomine et al. (2000, PNAS USA 97: 4991-6), 
Howden et al. (1995, Plant Physiol 107: 1067-73), Kesseler and Brand (1994, Eur J 
Biochem 225: 907-22) and Vernoux et al. (2000, Plant Cell 12: 97-1 10). 
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V.K.2. USE OF CADMIUM-RESPONSIVE GENES, GENE COMPONENTS 

AND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
The activities of one or more of the cadmium responsive genes can be modulated 
to change biochemical or metabolic activities and/or pathways such as those noted below. 
Such biological activities can be measured according to the citations included in the 
Table below: 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth , 

Differentiation and 
Development 


Root Growth 

Initiation and maintenance of 
cell division 

Resistance to Cadmium- 
inhibition of root growth 


Thomine et al. (2000) PNAS 
USA 97: 4991-6 
Vernoux et al. (2000) Plant Cell 
12: 97-110 


Metabolism 


Cadmium sensing 


Howden et al. (1995) Plant 
Physiol 107: 1067-73 




Cadmium uptake and 
transport 

Decreased cadmium 
transport 

Phytoremediation 


Gaur and Noraho (1995) Biomed 
Environ Sci 8: 202-10 
Thomine et al. (2000) PNAS 
USA 97: 4991-6 




Inhibition of oxidative 
phophorylation 


Kesseler and Brand (1994) Eur. 
Biochem 225: 907-22 


Plant Defenses 


Viral resistance 
Inhibition of systemic 
movement of virus 
Block of viral disease 


Ghoshroy et al. (1998) Plant J 
13: 591-602 




Detoxification of heavy 
metals 


Clemens et al. (1999) EMBO J 
18: 3325-33 




Enhanced stress resistance 


Romero-Peurtas et al. (1999) 
Free Rad Res 31: S25-31 




Cadmium resistance 


Xiang and Oliver (1998) Plant 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




via modulation of jasmonic 
acid signaling pathway 


Cell 10: 1539-90 


Signaling 


Relief of post-translational 
gene silencing 


Citovsky et al. (1998) Plant J 16: 
13-20 



Other biological activities that can be modulated by the cadmium responsive genes and 
gene products are listed in the Reference tables. Assays for detecting such biological activities 
are described in the Protein Domain table. 

Cadmium responsive genes are characteristically differentially transcribed in response to 
fluctuating cadmium levels or concentrations, whether internal or external to an organism or cell. 
The MA_diff and/or AFLP_diff MAdiff and/or AFLP diff and/or AFLP diff table(s) report(s) 
the changes in transcript levels of various cadmium responsive genes following treatment with 
10 jaM cadmium, relative to untreated plants. 

Profiles of some cadmium responsive genes are shown in the Table below together with 
examples of the kinds of associated biological activities. 



TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders to 


Cadmium perception 


Transporters 


transcripts 


cadmium 


Cadmium uptake and 


Metabolic enzymes 




Application 


transport 


Change in cell membrane 






Cadmium metabolism 


structure and potential 




Genes induced by 


Synthesis of secondary 


Kinases and 




cadmium 


metabolites and/or 


Phosphatases 






proteins 


Transcription activators 






Modulation of 


Change in chromatin 






cadmium response 


structure and/or localized 
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TRANSCRIPT 
LEVELS 


TYPE OF 
GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 






transduction pathways 


DNA topology 






Specific gene 


RNA binding proteins 






transcription initiation 








Genes involved in 








inhibiting systemic 








movement of plant 








viral RNA 








Genes involved in post 








translational gene 








silencing 




Down-regulated 


Responders to 


Negative regulation of 


Transcription factors 


transcripts 


cadmium 


cadmium pathways 


Change in protein 






released 


structure by 




Genes repressed 




phosphorylation 




by cadmium 


Changes in pathways 


(kinases) or 






and processes operating 


Dephosphoryaltion 




Genes with 


in cells 


(phosphatases) 




discontinued 


Changes in metabolism 


Change in chromatin 




expression or 


other than cadmium 


structure and/or DNA 




unsTable mRNA 


pathways 


topology 




in presence of 


Genes involved in 


Factors for protein 




cadmium 


facilitating systemic 


synthesis and degradation 






movement of plant 


Metabolic enzymes 






viral RNA 


RNA binding proteins 






Genes involved in 








promoting post 








translational gene 








silencing 
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Use of Promoters of Cadmium Responsive Genes 

Promoters of Cadmium responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
a similar temporal, tissue, or environmentally specific patterns as the Cadmium responsive genes 
where the desired sequence is operably linked to a promoter of a Cadmium responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.L. DISEASE RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Often growth and yield are limited by the ability of a plant to tolerate stress conditions, 
including pathogen attack. To combat such conditions, plant cells deploy a battery of inducible 
defense responses, including the triggering of an oxidative burst and the transcription of 
pathogenesis-related protein (PR protein) genes. These responses depend on the recognition of a 
microbial avirulence gene product (avr) by a plant resistance gene product (R), and a series of 
downstream signaling events leading to transcription- independent and transcription-dependent 
disease resistance responses. Reactive oxygen species (ROS) such as H2O2 and NO from the 
oxidative burst plays a signaling role, including initiation of the hypersensitive response (HR) and 
induction of systemic acquired resistance (SAR) to secondary infection by unrelated pathogens. PR 
proteins are able to degrade the cell walls of invading microorganisms, and phytoalexins are directly 
microbicidal. 

The presence of an avirulent pathogen and/or changes in the concentrations of O2", H2O2 
and NO in the environment surrounding a plant cell modulate the activities of many genes and, 
therefore, the levels of many gene products. Examples of tobacco mosaic virus (TMV) 
responsive genes and gene products, many of them operating through an ROS signaling system, 
are shown in The Reference and Sequence Tables. These genes and/or products are responsible 
for effects on traits such as plant vigor and seed yield. The genes were discovered and 
characterized from a much larger set by experiments designed to find genes whose mRNA 
products changed in response to application of TMV to plants. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools to 
glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results from 
such microarray experiments conducted with AFGC chips containing some 1 0,000 non- 
redundant ESTs, selected from about 37,000 randomly sequenced ESTs generated from mRNA 
of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in response 
to TMV infection over the non infected controls were identified, compared to the Ceres full 
length cDNA and genomic sequence databanks, and equivalent Ceres clones identified. The 
MA_diff and/or AFLP_diff MA^diff and/or AFLP__diff and/or AFLP_diff table(s) report(s) the 
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results of this analysis, indicating those Ceres clones which are up or down regulated over 
controls, thereby indicating the Ceres clones which represent disease responsive genes. 

Manipulation of one or more disease responsive gene activities are useful to modulate the 
biological processes and/or phenotypes listed below. Disease responsive genes and gene products 
5 can act alone or in combination. Useful combinations include disease responsive genes and/or 
gene products with similar transcription profiles, similar biological activities, or members of the 
same or functionally related biochemical pathways. Whole pathways or segments of pathways 
are controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
10 altering phenotypes and biochemical activities of plants. 

Such disease responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in active oxygen 
concentration or in the absence of active oxygen fluctuations. The MA_diff and/or AFLP_diff 

en 

iji i MAdiff and/or AFLP_diff and/or AFLP diff Table(s) reports the transcript levels of the 
% experiment (see EXPT ID: Disease (relating to SMD 7342, SMD 7343)). For transcripts that had 
O higher levels in the samples than the control, a "+" is shown. A "-" is shown for when transcript 
levels were reduced in root tips as compared to the control. For more experimental detail see the 
+ Example section below. 

jU Disease genes are those sequences that showed differential expression as compared to 

20 controls, namely those sequences identified in the MA diff and/or AFLP diff MA diff and/or 
AFLPdiff and/or AFLP_diff tables with a "+" or "-" indication 

Disease Genes Identified By Cluster Analyses Of Differential Expression 

Disease Genes Identified By Correlation To Genes That Are Differentially 

25 Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
30 therefore participate in the same pathway or network. 
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A pathway or network of Disease genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Disease (relating to SMD 7342, SMD 7343) 
of the MA_diff and/or AFLP„diff MA_diff and/or AFLP_diff and/or AFLP_diff table(s). 

5 Disease Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Disease genes. A group in the MA_clust is 
considered a Disease pathway or network if the group comprises a cDNA ID that also appears in 
10 Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 



|j! Disease Genes Identified By Amino Acid Sequence Similarity 

m 

ify Disease genes from other plant species typically encode polypeptides that share 

; i45 amino acid similarity to the sequences encoded by corn and Arabidopsis Disease genes. Groups 
[g of Disease genes are identified in the Protein Group table. In this table, any protein group that 
^ | comprises a peptide ID that corresponds to a cDNA ID member of a Disease pathway or network 
; ;|5 is a group of proteins that also exhibits Disease functions/utilities. 

Further, promoters of disease responsive genes, as described in the Reference tables, for 
20 example, are useful to modulate transcription that is induced by disease or any of the following 
phenotypes or biological activities below. Further, any desired sequence can be transcribed in 
similar temporal, tissue, or enviromentally specific patterns as the disease responsive genes when 
the desired sequence is operably linked to a promoter of a disease responsive gene. 

25 V.L.I. USE OF DISEASE RESPONSIVE GENES, GENE COMPONENTS 

AND PRODUCTS TO MODULATE PHENOTYPES 
Disease responsive genes and gene products are useful to or modulate one or more of the 
following phenotypes: 

• Pathogen Tolerance and/or Resistance 
30 • Avr/R locus interactions 

♦ Non-Host interactions 
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• HR 

. SAR 

Resistance to bacteria e.g. to Erwinia stewartii, Pseudomonas syringae, Pseudomonas 
tabaci, Stuart's wilt, etc. 

• Resistance to fungi e.g. to downy mildews such as Scleropthora macrospora, 
Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; rusts such as Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; and to 
other fungal diseases e.g. Cercospora zeae-maydis, Colletotrichum graminicola, 
Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, Phytophthora 
parasitica, Peronospora tabacina, Septoria, etc.; 

Resistance to viruses or viroids e.g. to tobacco or cucumber mosaic virus, ringspot 
virus, necrosis virus, pelargonium leaf curl virus, red clover mottle virus, tomato 
bushy stunt virus, and like viruses; 

• Resistance to insecs, such as to aphids e.g. Myzus persicae; to beetles and beetle 
larvae; to lepidoptera larvae e.g. Heliothus etc. 

• Resistance to Nematodes, e.g. Meloidogyne incognita etc 

Local resistance in primary (infected) or secondary (uninfected) leaves 
Stress Tolerance 

• Winter Survival 
Cold Tolerance 

• Salt tolerance 

• Heavy Metal Tolerance, such as Cadmium 

• Tolerance to Physical Wounding; 

• Increased Organelle Tolerance to Redox Stress, such as in Mitochondria, and 
chloroplasts 

. Cell Death 

• Programmed cell death, including death of diseased tissue and during senescence 
Fruit Drop 

Biomass 

Fresh and Dry Weight during any time in plant life, such as maturation 
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Number of Flowers, Seeds, Branches, and/or Leaves 

Seed Yield, including Number, Size, Weight, and/or Harvest Index 

Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

Plant Development 

Time to Fruit Maturity 

Cell Wall Strengthening and Reinforcement 

Plant Product Quality 

Paper making quality 

Food additives 

Treatment of Indications modulated by Free Radicals 
Cancer 

Kinds of low molecular weight compounds such as phytoalexins 
Abundance of low molecular weight compounds such as phytoalexins 
Other phenotypes based on gene silencing 



5 M 

m 
m 



20 



25 



To regulate any of the phenotype(s) above, activities of one or more of the disease 
responsive genes or gene products can be modulated and the plants can be tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be screened for variants as in 
Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed , for example, in accordance to Alvarez 
et al., (1998) Cell 92: 773-784; Halhbrock and Scheel, (1989) Ann. Rev. Plant Physiol. Plant Mol. 
Biol. 40: 347-369; Lamb et al., (1997) Ann. Rev. Plant Mol. Biol. Plant Physiol. 48: 251-275; 
Lapwood et al. (1984) Plant Pathol. 33: 13-20; Levine et al. (1996) Curr. Biol. 6: 427-437; 
McKersie et al., (2000) Plant Physiol. 122: 1427-1437; Olson and Varner (1993) Plant J. 4: 887- 
892; Pastore et al., (2000), FEBS Lett 470: 88-92; Pastori et aL, (1997) Plant Physiol. 1 13: 41 1-418; 
Romero-Puertas et al., (1999) Free Radic. Res. 1999 31 Suppl: S25-31; Shirataki et aL, Anticancer 
Res 20: 423-426 (2000); Wu et al., (1995) Plant Cell 7: 1357-1368. 



30 



V.L.2. USE OF DISEASE RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
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The activities of one or more of the disease responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. 
Such biological activities are documented and can be measured according to the citations 
above and included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Resistance to Pathogens 


Induction of ROS signaling 
pathways 


Wu et.al.(1995) Plant Cell 7: 
1357-68 




Modulation of nitric oxide 
signaling 


Delledonne et al. (1998) Nature 
394: 585-588 




Induction of PR proteins, 
phytoalexins, and defense 
pathways 


Chamnongpol et.al.(1998) Proc. 
Nat.Acad Sci USA 12;95:5818- 
23. 

Davis etal. (1993) 
Phytochemistry 32: 607-611 




Induction of cellular 
protectant genes such as 
glutathione S-transferase 
(GST) and ascorbate 
peroxidase 


Chen et.al. Plant J. (1996) 
10:955-966 

Gadea et.al.(1999) Mol Gen 

Genet 262:212-219 

Wu et.al.(1995) Plant Cell 7: 

1357-68 




ROS levels following 
wounding and changes in 
physical pressure 


Orozco-Cardenas and Ryan 

(1999) Proc.Nat. Acad. Sci. USA 

25;96:6553-7. 

Yahraus et al. (1995) Plant 

Physiol. 109: 1259-1266 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Salicyclic acid levels and 
signaling 


Durner and Klessig (1996) 
J.Biol.Chem. 271:28492-501 


Responses to Wounding 


Expression of genes Involved 
in wound repair and cell 
division 


Legendre et al. (1993) Plant 
Physiol. 102: 233-240 


Responses to Environmental 
Stress 


Expression of genes involved 
in responses to drought, cold, 
salt, heavy metals 


Shi et al. (2000) Proc. Natl. Acad. 
Sci.USA 97:6896-6901 


Reinforcement of Cell Walls 


Modulation of the Production 
of ExtracTable Proline-Rich 
Protein 


Bradley et al. (1992) Cell 70, 21- 
30 




Modulation of Lignification 


Mansouri et al. (1999) Physiol. 
Plant 106: 355-362 


Programmed Cell Death 


Induction of PCD activating 
genes 


Levine et al. (1996) Curr. Biol. 6: 
427-437. Reynolds etal. (1998) 
Biochem.J. 330:115-20 




Suppression of PCD 
suppressing genes 


Pennell and Lamb (1997) Plant 
Cell 9, 1157-1168 



Other biological activities that can be modulated by the disease responsive genes and 
their products are listed in the Reference Table. Assays for detecting such biological activities 
are described in the Protein Domain table. 

Disease responsive genes are characteristically differentially transcribed in response to 
fluctuating levels of disease. The MA_diff and/or AFLP_diff MA__diff and/or AFLP_diff and/or 
AFLP_diff table(s)report(s) the changes in transcript levels of various disease responsive genes 
in the aerial parts of a plant 3 days after the plant was sprayed with a suspension of TMV relative 
to control plants sprayed with water. 
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The data from this experiment reveal a number of types of disease responsive genes and 
gene products, including "early responders," and "delayed responders". Profiles of individual 
disease responsive genes are shown in the Table below with examples of which associated 
biological activities are modulated when the activities of one or more such genes vary in plants. 
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GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 

OF GENE PRODUCTS 


Upregulated 
transcripts 


Early 

Responders to 
Pathogens 


ROS Perception and 
Response 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodeling proteins 






Initiation of Gene 
Transcription 


Glutathione S-transferase 
(GST), 

heat shock proteins, 
salicylic acid (SA) response 
pathway proteins, jasmonate 
response pathway proteins, 
dehydrins, peroxidases, 
catalases 




Delayed 
Responders to 
Pathogens 


Initiation of Defence 
Gene Transcription 


Proteases, pathogen 
response (PR) proteins, 
cellulases, chitinases, 
cutinases, glucanases, other 
degrading enzymes, calcium 
channel blockers, 
phenylalanine ammonia 
lyase, proteins of defense 
pathways, cell wall proteins 
incuding proline rich 
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proteins and glycine rich 
proteins, epoxide hydrolase, 
methyl transferases 






Activation of cell death 
pathways 


Transcription factors 
kinases, phosphatases, DNA 
surveillance proteins, p53, 
proteases, endonucleases, 
GTP-binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, 
mitochondrial and 
chloroplast energy related 
proteins, ribosome 
inactivating proteins 






Initiation of Cellular 
Protectant Gene 
Transcription 


Reactive oxygen scavenging 
enzymes, GST, catalase, 
peroxidase, ascorbate 
oxidase 


Downregulated 
transcripts 


Early 

responders to 
pathogens 


Negative regulation of 
pathogen inducible 
pathways released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes repressed 
by TMV 


Negative regulation of 
ROS inducible 


Transcription factors, 
kinases, phosphatases, GTP- 
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pathways released 


binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Delayed 
Responders to 
Pathogens 


Negative regulation of 
pathogen inducible 
pathways released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes repressed 
by TMV 


Negative regulation of 
genes suppressing 
programmed cell death 
released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 



Use of Promoters of Disease Responsive Genes 

Promoters of Disease responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
a similar temporal, tissue, or environmentally specific patterns as the Disease responsive genes 
where the desired sequence is operably linked to a promoter of a Disease responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
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promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.M. DEFENSE (LQL2) RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Often growth and yield are limited by the ability of a plant to tolerate stress conditions, 
including pathogen attack. To combat such conditions, plant cells deploy a battery of inducible 
defense responses, including the triggering of an oxidative burst and the transcription of 
pathogenesis-related protein (PR protein) genes. Reactive oxygen species (ROS) such as H2O2 and 
NO from the oxidative burst play a signaling role, including initiation of the hypersensitive response 
(HR) and induction of systemic acquired resistance (S AR) to secondary infection by unrelated 
pathogens. Some PR proteins are able to degrade the cell walls of invading microorganisms, and 
phytoalexins are directly microbicidal. Other defense related pathways are regulated by salicylic 
acid (SA) or methyl jasmonate (Me J). 

These responses depend on the recognition of a microbial avirulence gene product (avr) by a 
plant resistance gene product (R), and a series of downstream signaling events leading to 
transcription- independent and transcription-dependent disease resistance responses. Current 
models suggest that R- gene-encoded receptors specifically interact with pathogen-encoded ligands 
to trigger a signal transduction cascade. Several components include ndrl and edsl loci. NDR1, 
EDS 1 3 PR1 , as well as PDF 1 .2, a Me J regulated gene and Niml , a SA regulated gene, are 
differentially regulated in plants with mutations in the LOL2 gene. 

LOL2 shares a novel zinc finger motif with LSD1, a negative regulator of cell death and 
defense response. Due to an alternative splice site the LOL2 gene encodes two different proteins, 
one of which contains an additional, putative DNA binding motif Northern analysis 
demonstrated that LOL2 transcripts containing the additional DNA binding motif are 
predominantly upregulated after treatment with both virulent and avirulent Pseudomonas 
syringae pv maculicola strains. Modulation in this gene can also confer enhanced resistance to 
virulent and avirulent Peronospora parasitica isolates 

Examples of LOL2 responsive genes and gene products are shown in the Reference, 
Sequence, Protein Group, Protein Group Matrix, MA_diff and/or AFLP_diff MA_diff and/or 
AFLP_diff and/or AFLP diff and MA clust tables. These genes and/or products are 
responsible for effects on traits such as plant vigor, disease resistance, and seed yield. The genes 
were discovered and characterized from a much larger set by microarray experiments designed to 
find genes whose mRNA products changed when the LOL2 gene was mutated in plants. 
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Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools to 
glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results from 
5 such microarray experiments conducted with AFGC chips containing some about 10,000 non- 
redundant ESTs, selected from about 37,000 randomly sequenced ESTs generated from mRNA 
of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants with 
the LOL2 mutation versus wildtype were obtained. Specifically, the plant line lol-2-2 tested, a 
10 loss of function mutation. The ESTs were compared to the Ceres full length cDNA and genomic 
•jp sequence databanks, and equivalent Ceres clones identified. The MA_diff and/or AFLP_diff 
:;^J MA_diff and/or AFLP_diff and/or AFLP_diff table reports the results of this analysis, 
^1 indicating those Ceres clones which are up or down regulated over controls, thereby indicating 

en 

Ijlj the Ceres clones which represent LOL2 responsive genes. 

'^5 Manipulation of one or more LOL2 responsive gene activities are useful to modulate the 

•P biological processes and/or phenotypes listed below. LOL2 responsive genes and gene products 
|y can act alone or in combination. Useful combinations include LOL2 responsive genes and/or 

gene products with similar transcription profiles, similar biological activities, or members of the 
M same or functionally related biochemical pathways. Whole pathways or segments of pathways 
20 are controlled by transcription factor proteins and proteins controlling the activity of signal 

transduction pathways. Therefore, manipulation of such protein levels is especially useful for 

altering phenotypes and biochemical activities of plants. 

Such LOL2 responsive genes and gene products can function to either increase or 

dampen the above phenotypes or activities either in response to changes in active LOL2 gene or 
25 in the absence. The MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff 

Table(s) reports the transcript levels of the experiment (see EXPT ID: lol2 (relating to SMD 

8031, SMD 8032)). For transcripts that had higher levels in the samples than the control, a "+" 

is shown. A "-" is shown for when transcript levels were reduced in root tips as compared to the 

control. For more experimental detail see the Example section below. 
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Defense genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff and/or AFLP_diff MA_diff and/or 
AFLP_diff and/or AFLP_diff tables with a "+" or "-" indication 

Defense Genes Identified By Cluster Analyses Of Differential Expression 

Defense Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Defense genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID lol2 (relating to SMD 8031, SMD 8032) of 
the MA diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff table(s). 

Defense Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Defense genes. A group in the MA_clust is 
considered a Defense pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Defense Genes Identified By Amino Acid Sequence Similarity 

Defense genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Defense genes. Groups 
of Defense genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Defense pathway or 
network is a group of proteins that also exhibits Defense functions/utilities. 
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Further, promoters of LOL2 responsive genes, as described in the Reference tables, for 
example, are useful to modulate transcription that is induced by LOL2 responsive genes or any 
of the following phenotypes or biological activities below. Further, any desired sequence can be 
transcribed in similar temporal, tissue, or enviromentally specific patterns as the LOL2 
5 responsive genes when the desired sequence is operably linked to a promoter of a LOL2 
responsive gene. 



w 

m 
m 



lit 
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V.M. 1 . USE OF LOL2 RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 

Lol2 Responsive Genes And Gene Products Are Useful To Or Modulate One Or More Of 
5 The Following Phenotypes: 

Pathogen Tolerance and/or Resistance 
Avr/R locus interactions 
Non-Host interactions 

• HR 

10 • SAR, e.g., disease responsive genes acting in conjunction with infection with any of 

the organisms listed below 

Resistance to bacteria e.g. to Erwinia stewartii, Pseudomonas syringae, Pseudomonas 
|jj tabaci, Stuart's wilt, etc. 

Resistance to fungi e.g. to downy mildews such as Scleropthora macrospora, 
Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; rusts such as Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; and to 
other fungal diseases e.g. Cercospora zeae-maydis, Colletotrichum graminicola, 
*: Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, Phytophthora 

20 parasitica, Peronospora tabacina, Septoria, etc.; 

Resistance to viruses or viroids e.g. to tobacco or cucumber mosaic virus, ringspot 
virus, necrosis virus, pelargonium leaf curl virus, red clover mottle virus, tomato 
bushy stunt virus, and like viruses; 

♦ Resistance to insecs, such as to aphids e.g. Myzus persicae; to beetles and beetle 
25 larvae; to lepidoptera larvae e.g. Heliothus etc. 

Resistance to Nematodes, e.g. Meloidogyne incognita etc 
Local resistance in primary (infected) or secondary (uninfected) leaves 
Stress Tolerance 
Winter Survival 
Cold Tolerance 
Salt tolerance 



30 
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• Heavy Metal Tolerance, such as Cadmium 

• Tolerance to Physical Wounding; 

• Increased Organelle Tolerance to Redox Stress, such as in Mitochondria, and 
chloroplasts 

5 • Cell Death 

Programmed cell death, including death of diseased tissue and during senescence 
Fruit Drop 

• Biomass 

• Fresh and Dry Weight during any time in plant life, such as maturation 

10 . Number of Flowers, Seeds, Branches, and/or Leaves 

ft— 

• Seed Yield, including Number, Size, Weight, and/or Harvest Index 

'° • Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

yj 

If! • Plant Development 

ijji 

i • Time to Fruit Maturity 

H i5 • Cell Wall Strengthening and Reinforcement 

O • Plant Product Quality 

KB 

• Paper making quality 

• Food additives 

iM* ♦ Treatment of Indications modulated by Free Radicals 

20 . Cancer 

• Kinds of low molecular weight compounds such as phytoalexins 
Abundance of low molecular weight compounds such as phytoalexins 
Other phenotypes based on gene silencing 

25 To regulate any of the phenotype(s) above, activities of one or more of the LOL2 

responsive genes or gene products can be modulated and the plants can be tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be screened for variants as in 
Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed , for example, in accordance to 

30 Alvarez et al., (1998) Cell 92: 773-784; Halhbrock and Scheel, (1989) Ann. Rev. Plant Physiol. 
Plant Mol. Biol. 40: 347-369; Lamb et al., (1997) Ann. Rev. Plant Mol. Biol. Plant Physiol. 48: 
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251-275; Lapwood et al. (1984) Plant Pathol. 33: 13-20; Levine et al. (1996) Curr. Biol. 6: 427- 
437; McKersie et al., (2000) Plant Physiol. 122: 1427-1437; Olson and Varner (1993) Plant J. 4: 
887-892; Pastore et al., (2000), FEBS Lett 470: 88-92; Pastori et al., (1997) Plant Physiol. 113: 
41 1-418; Romero-Puertas et al., (1999) Free Radic. Res. 1999 31 Suppl: S25-31; Shirataki et al., 
Anticancer Res 20: 423-426 (2000); Wu et al., (1995) Plant Cell 7: 1357-1368. 

V.M.2.USE OF DEFENSE RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the defense (LOL2) responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as those 
noted below. Such biological activities are documented and can be measured according to 
the citations above and included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Resistance To Pathogens 


Induction Of ROS Signaling 
Pathways 


Wu et.al.(1995) Plant Cell 7: 
1357-68 




Modulation Of Nitric Oxide 
Signaling 


Delledonne et al. (1998) Nature 
394: 585-588 




Induction Of PR Proteins, 
Phytoalexins, And Defense 
Pathways 


Chamnongpol et.al.(1998) Proc. 
Nat.Acad Sci USA 12;95:5818-23. 
Davis et al. (1993) Phytochemistry 
32: 607-611 




Induction Of Cellular 
Protectant Genes Such As 
Glutathione S-Transferase 
(GST) And Ascorbate 
Peroxidase 


Chen etal. Plant J. (1996) 10:955- 
966 

Gadea et.al.(1999) Mol Gen Genet 
262:212-219 

Wu et.al.(1995) Plant Cell 7: 
1357-68 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




ROS Levels Following 
Wounding And Changes In 
Physical Pressure 


Orozco-Cardenas and Ryan (1999) 

Proc.Nat. Acad. Sci. USA 

25;96:6553-7. 

Yahraus et al. (1995) Plant 

Physiol. 109: 1259-1266 




Salicyclic Acid Levels And 
Signaling 


Durner and Klessig (1996) 
J.Biol.Chem. 271:28492-501 


Responses To Wounding 


Expression Of Genes Involved 
In Wound Repair And Cell 
Division 


Legendre et al. (1993) Plant 
Physiol. 102: 233-240 


Responses To 
Environmental Stress 


Expression Of Genes Involved 
In Responses To Drought, 
Cold, Salt, Heavy Metals 


Shi et al. (2000) Proc. Natl. Acad. 
Sci. USA 97:6896-6901 


Reinforcement Of Cell 
Walls 


Modulation Of The Production 
Of Extrac Table Proline-Rich 
Protein 


Bradley et al. (1992) Cell 70, 21- 
30 




Modulation Of Lignification 


Mansouri et al. (1999) Physiol. 
Plant 106:355-362 


Programmed Cell Death 


Induction Of Pcd Activating 
Genes 


Levine et al. (1996) Curr. Biol. 6: 
427-437. Reynolds et.al. (1998) 
Biochem.J. 330:115-20 




Suppression Of PCD 
Suppressing Genes 


Pennell and Lamb (1997) Plant 
Cell 9, 1157-1168 



Other biological activities that can be modulated by the LOL2 responsive genes and their 
products are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 
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L0L2 responsive genes are characteristically differentially transcribed in response to 
fluctuating levels of disease. MA__diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or 
AFLPdiff table reports the changes in transcript levels of various LOL2 responsive genes in the 
lol-2 line versus control plants. 

The data from this experiment reveal a number of types of LOL2 responsive genes and 
gene products. Profiles of individual LOL2 responsive genes are shown in the Table below with 
examples of which associated biological activities are modulated when the activities of one or 
more such genes vary in plants. 



GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 

OF GENE PRODUCTS 


Upregulated 
transcripts 


Early 

Responders to 
the LOL2 
Mutation 


ROS Perception and 
Response 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodeling proteins 






Initiation of Gene 
Transcription 


Glutathione S-transferase 
(GST), 

heat shock proteins, 
salicylic acid (SA) response 
pathway proteins, jasmonate 
response pathway proteins, 
dehydrins, peroxidases, 
catalases 




Delayed 
Responders to 
the LOL2 


Initiation of Defence 
Gene Transcription 


Proteases, pathogen 
response (PR) proteins, 
cellulases, chitinases, 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY 


CONSequence 


BIOCHEMICAL 


LEVELS 


OF GENE 




ACTIVITY 








OF GENE PRODUCTS 




Mutation 




cutinases, glucanases, other 








degrading enzymes, calcium 








channel blockers, 








phenylalanine ammonia 








lyase, proteins of defense 








pathways, cell wall proteins 








incuding proline rich 








proteins and glycine rich 








proteins, epoxide hydrolase, 








methyl transferases 






Activation of cell death 


Transcription factors 






pathways 


kinases, phosphatases, DNA 








surveillance proteins, p53, 








proteases, endonucleases, 








GTP-binding proteins (G- 








proteins). leucine rich reneat 








proteins (LRRs), 








transporters, calcium 








binding proteins, 








mitochondrial and 








chloroplast energy related 








proteins, ribosome 








inactivating proteins 






Initiation of Cellular 


Reactive oxygen scavenging 






Protectant Gene 


enzymes, GST, catalase, 






Transcription 


peroxidase, ascorbate 
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GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 

OF GENE PRODUCTS 








oxidase 


Downregulated 
transcripts 


Early 

Responders to 
the LOL2 
Mutation 


Negative regulation of 
LOL2 Mutation 
inducible pathways 
released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes 

Repressed by 
the LOL2 
Mutation 


Negative regulation of 
ROS inducible 
pathways released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Delayed 
Responders to 
the LOL2 
Mutation 


Negative regulation of 
LOL2 Mutation 
inducible pathways 
released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes 


Negative Regulation Of 


Transcription Factors, 
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GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 

OF GENE PRODUCTS 




Repressed By 
The LOL2 
Mutation 


Genes Suppressing 
Programmed Cell 
Death Released 


Kinases, Phosphatases, 
GTP-Binding Proteins (G- 
Proteins), Leucine Rich 
Repeat Proteins (Lrrs), 
Transporters, Calcium 
Binding Proteins, 
Chromatin Remodelling 
Proteins 



Use of Promoters of Defense Responsive Genes 
■ ^ Promoters of Defense responsive genes are useful for transcription of any desired 

=jg polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
^5 a similar temporal, tissue, or environmentally specific patterns as the Defense responsive genes 
p where the desired sequence is operably linked to a promoter of a Defense responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
10 or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.N. IRON RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Iron (Fe) deficiency in humans is the most prevalent nutritional problem worldwide 
today. Increasing iron availability via diet is a sustainable malnutrition solution for many of the 
world's nations. One-third of the world's soils, however, are iron deficient. Consequently, to 
form a food-based solution to iron malnutrition, we need a better understanding of iron uptake, 
storage and utilization by plants. Furthermore, exposure to non-toxic Fe levels appears to affect 
inherent plant defense mechanisms. Consequently, exploring the effects of Fe exposure has 
potential for advances in plant disease resistance in addition to human nutrition. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two differentially 
labeled fluorescent FeNA pools to glass slides that contain spots of DNA (Schena et al. (1995) 
Science 270: 467-70). The Arabidopsis Functional Genomics Consortium (AFGC) has recently 
made public the results from such microarray experiments conducted with AFGC chips 
containing 10,000 non-redundant ESTs, selected from 37,000 randomly sequenced ESTs 
generated from mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full length FeNA and genomic sequence 
databanks, and identical Ceres clones identified. MA_diff and/or AFLP_diff MAdiff and/or 
AFLPdiff and/or AFLP_diff table reports the results of this analysis, indicating those Ceres 
clones that are up or down regulated over controls, thereby indicating the Ceres clones which are 
iron responsive genes. 

The MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP diff Table(s) 
reports the transcript levels of the experiment (see EXPT ID: Iron (relating to SMD 7114, SMD 
7115, SMD 7125)). For transcripts that had higher levels in the samples than the control, a 
is shown. A is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Iron genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA diff and/or AFLP_diff MA_diff and/or 
AFLP_diff and/or AFLP diff tables with a "+" or indication 

Iron Genes Identified By Cluster Analyses Of Differential Expression 
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Iron Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
5 microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Iron genes is any group in the MA clust that comprises 
a cDNA ID that also appears in Expt ID Iron (relating to SMD 7114, SMD 7115, SMD 7125) of 
10 the MA_diff and/or AFLPdiff MA_diff and/or AFLPdiff and/or AFLP diff table(s). 



Iron Genes Identified By Correlation To Genes That Cause Physiological 
!jj Consequences 

1j Additionally, the differential expression data and the phenotypic observations can 

i*5 be merged to identify pathways or networks of Iron genes. A group in the MA_clust is 

considered a Iron pathway or network if the group comprises a cDNA ID that also appears in 
ry Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 



m 



20 Iron Genes Identified By Amino Acid Sequence Similarity 

Iron genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Iron genes. Groups of 
Iron genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Iron pathway or network is 

25 a group of proteins that also exhibits Iron functions/utilities. 



V.N.I. USE OF IRON RESPONSIVE GENES TO MODULATE PHENOTYPES 
Iron responsive genes and gene products are useful to or modulate one or more of 
30 the following phenotypes: 

Growth 
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• Roots 

• Root hair formation 
Stems 

Leaves 
Development 

• Senescence 
Plant nutrition 

Uptake and assimilation of organic compounds 
Uptake and assimilation of inorganic compounds 

• Animal (including human) nutrition 
Improved dietary mineral nutrition 

• Stress Response 
Metabolic detoxification 

• Heavy metals 

To improve any of the phenotype(s) above, activities of one or more of the iron 
responsive genes or gene products can be modulated and tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and visually inspected for the desired phenotype or metabolically 
and/or functionally assayed according to Schmidt et al. (2000, Plant Physiol 122:1 109- 
18), Meagher (2000) Current Opinion in Plant Biology 3: 153-62), Deak (1999, Nature 
Biotechnology (1999, Nature Biotechnology 17: 192-96), Wei and Theil (2000, J. Biol 
Chem 275: 17488-93) and Vansuyt et al. (1997, FEBS Letters 410: 195-200). 

V.N.2. USE OF IRON-RESPONSIVE GENES, GENE COMPONENTS AND 

PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
The activities of one or more of the iron responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. 
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Such biological activities can be measured according to the citations included in the 
Table below: 




y 
m 
m 
m 
m 

m 
m 

3~ 




562 



PATENT 

Attorney Docket No. 2750-1481P 





PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 






METABOLIC ACTIVITIES 


ASSAYS 






AND/OR PATHWAYS 






Growth , Differentiation 


Root Growth 


Robinson et al. (1999) 




and Development 


Initiation of root 


Nature 397: 694-97 






hairs 






Metabolisms 


Iron sensing 


Thomine et al. (2000) 








PNAS USA 97- 4991-6 






Iron uptake and transport 


Thomine et al. (2000) 


f% 
hQ 




-decreased iron 


PNAS USA 97:4991-6 


W 




transport 


Zhu(1999) Plant 


Ml 

Us 




-phytoremediation 


Physiol 119: 73-79 


I1J 


Plant Defenses 










Protection from oxidative 


Deak(1999) Nature 


CO 
111 




damage 


Biotechnology 17: 192- 
6 




Signaling 


Specific gene 


Brand and Perrimon 






transcription gene 


(1993) Development 






silencing 


118:401-415 



Other biological activities that can be modulated by the iron responsive genes and gene 
products are listed in the REFERENCE Table. Assays for detecting such biological activities are 
5 described in the Protein Domain table. 

Iron responsive genes are characteristically differentially transcribed in response to 
fluctuating iron levels or concentrations, whether internal or external to an organism or cell. 
MA_diff and/or AFLP_diff MA_diff and/or AFLP diff and/or AFLP_diff table reports the 
changes in transcript levels of various iron responsive genes. 
0 The microarray comparison consists of probes prepared from root RNA of A. thaliana 

(Columbia) seedlings grown under iron- sufficient conditions and seedlings grown under iron- 
deficient. The data from this experiment reveal a number of types genes and gene products. 
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Profiles of these different iron responsive genes are shown in the Table below with examples of 
associated biological activities. 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


responders to iron 


♦ Iron perception 


• Transporters 


transcripts 


application 


Iron uptake and 


• Metabolic enzymes 






transport 


• Change in cell 






Iron metabolism 


membrane structure 






Synthesis of 


and potential 






secondary 


• Kinases and 






metabolites 


phosphatases 






and/or proteins 


Transcription 








activators 






Modulation of 


• Change in chromatin 






iron response 


structure and/or 






transduction 


localized DNA 






pathways 


topology 






Specific gene 








transcription 








initiation 




Down-regulated 


responder to iron 


• Negative 


• Transcription factors 


transcripts 


repressors of iron state 


regulation of iron 


• Change in protein 




of metabolism 


pathways 


structure by 








phosphorylation 




Genes with 


Changes in 


(kinases) or 




discontinued 


pathways and 


dephosphoryaltion 




expression or 


processes 


(phosphatases) 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 




unsTable mRNA in 
presence of iron 


operating in cells 
• Changes in other 
metabolisms than 
iron 


Change in chromatin 
structure and/or 
DNA topology 
Stability of factors 
for protein synthesis 
and degradation 
Metabolic enzymes 



Use of Promoters of Iron Responsive Genes 

Promoters of Iron responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
a similar temporal, tissue, or environmentally specific patterns as the Iron responsive genes 
where the desired sequence is operably linked to a promoter of a Iron responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.O. SHADE RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Plants sense the ratio of Red (R) : Far Red (FR) light in their environment and respond 
differently to particular ratios. A low R:FR ratio, for example, enhances cell elongation and 
favors flowering over leaf production. The changes in R:FR ratios mimic and cause the shading 
response effects in plants. The response of a plant to shade in the canopy structures of 
agricultural crop fields influences crop yields significantly. Therefore manipulation of genes 
regulating the shade avoidance responses can improve crop yields. While phytochromes mediate 
the shade avoidance response, the down-stream factors participating in this pathway are largely 
unknown. One potential downstream participant, ATHB-2, is a member of the HD-Zip class of 
transcription factors and shows a strong and rapid response to changes in the R:FR ratio. ATHB- 
2 overexpressors have a thinner root mass, smaller and fewer leaves and longer hypocotyls and 
petioles. This elongation arises from longer epidermal and cortical cells, and a decrease in 
secondary vascular tissues, paralleling the changes observed in wild-type seedlings grown under 
conditions simulating canopy shade. On the other hand, plants with reduced ATHB-2 expression 
have a thick root mass and many larger leaves and shorter hypocotyls and petioles. Here, the 
changes in the hypocotyl result from shorter epidermal and cortical cells and increased 
proliferation of vascular tissue. Interestingly, application of auxin is able to reverse the root 
phenotypic consequences of high ATHB-2 levels, restoring the wild-type phenotype. 
Consequently, given that ATHB-2 is tightly regulated by phytochrome, these data suggest that 
ATHB-2 may link the auxin and phytochrome pathways in the shade avoidance response 
pathway. 

Changes in R:FR ratios promote changes in gene expression. Microarray technology 
allows monitoring of gene expression levels for thousands of genes in a single experiment. This 
is achieved by hybridizing labeled fluorescent cDNA pools to glass slides that contain spots of 
DNA (Schena et al. (1995) Science 270: 467-70). The US Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing about 10,000 non-redundant ESTs, selected from 
about3 7,000 randomly sequenced ESTs generated from mRNA of different tissues and 
developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants 
given 4 hours of FR rich light after growth in high R:FR light compared with the controls of 
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plants grown in high R:FR light only, were identified, compared to the Ceres full length cDNA 
and genomic sequence databanks, and equivalent Ceres clones identified. The MA_diff and/or 
AFLP_diff MA_diff and/or AFLPdiff and/or AFLP_diff table(s) report(s) the results of this 
analysis, indicating those Ceres clones which are up or down regulated over controls, thereby 
indicating the Ceres clones which are shade avoidance responsive genes. 

Examples of far red light induced, shade avoidance responsive genes and gene products 
are shown in the Reference and Sequence Tables. These genes and/or products are responsible 
for effects on traits such as plant vigor and seed yield. 

While far red light, shade avoidance responsive polynucleotides and gene products can 
act alone, combinations of these polynucleotides also affect growth and development. Useful 
combinations include different shade avoidance responsive polynucleotides and/or gene products 
that have similar transcription profiles or similar biological activities, and members of the same 
or similar biochemical pathways. In addition, the combination of a shade avoidance responsive 
polynucleotide and/or gene product with another environmentally responsive polynucleotides is 
also useful because of the interactions that exist between hormone regulated pathways, stress and 
pathogen induced pathways, nutritional pathways, light induced pathways and development. 
Here, in addition to polynucleotides having similar transcription profiles and/or biological 
activities, useful combinations include polynucleotides that may have different transcription 
profiles but which participate in common or overlapping pathways. 

Such far red light induced shade avoidance responsive genes and gene products 
can function to either increase or dampen the above phenotypes or activities either in response to 
changes in far red light or in the absence of far red light fluctuations. The MA_diff and/or 
AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff Table(s) reports the transcript levels 
of the experiment (see EXPT ID: Shade (relating to SMD 8130, SMD 7230)). For transcripts 
that had higher levels in the samples than the control, a "+" is shown. A "-" is shown for when 
transcript levels were reduced in root tips as compared to the control. For more experimental 
detail see the Example section below. 

Shade genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff and/or AFLP_diff MA_diff and/or 
AFLP diff and/or AFLP_diff tables with a or indication 
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Shade Genes Identified By Cluster Analyses Of Differential Expression 

Shade Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Shade genes is any group in the MA clust that 
comprises a cDNA ID that also appears in Expt ID Shade (relating to SMD 8130, SMD 7230) of 
the MA_diff and/or AFLP_diff MA_diff and/or AFLP^diff and/or AFLP_diff table(s). 

Shade Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Shade genes. A group in the MA clust is 
considered a Shade pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Shade Genes Identified By Amino Acid Sequence Similarity 

Shade genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Shade genes. Groups of 
Shade genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Shade pathway or network 
is a group of proteins that also exhibits Shade functions/utilities. 

Further, promoters of shade avoidance responsive genes, as described in the Reference 
tables, for example, are useful to modulate transcription that is induced by shade avoidance or 
any of the following phenotypes or biological activities below. Further, any desired sequence can 
be transcribed in similar temporal, tissue, or environmentally specific patterns as the shade 
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avoidance responsive genes when the desired sequence is operably linked to a promoter of a 
circadian (clock) responsive gene. 
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V.0.1. USE OF FAR RED RESPONSIVE, SHADE AVOIDANCE RESPONSE 

GENES TO MODULATE PHENOTYPES 
High FR:R, shade avoidance responsive genes and gene products can be used to 
alter or modulate one or more of the following phenotypes: 
Growth 
Roots 
Elongation 

Lateral root formation 
Stems 
Elongation 
Expansion 
Leaves 
Expansion 

Carotenoid composition 
Development 
Cell 
Growth 
Elongation 

Photosynthetic apparatus 
Efficiency 
Flower 

Flowering time 
Fruit 
Seed 

Dormancy 

Control rate and timing of germination 
Prolongs seed storage and viability 
Inhibition of hydro lytic enzyme synthesis 
Seed and Fruit yield 
Senescence 
Abscission 
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. Leaf fall 

• Flower longevity 
Differentiation 
Vascularization 

• Shade (Avoidance) Responses in plant architecture 

To regulate any of the phenotype(s) above, activities of one or more of the High FR: R light, 
shade avoidance responsive genes or gene products can be modulated and the plants tested by 
screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered 
in a plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. Mol. 
Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 
and visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Carabelli et al. (1996, PNAS USA 93: 3530-3535), Aguirrezabal and Tardieu (1996, J 
Exp Bot 47: 41 1-20), Heyer et al. (1995, Plant Physiol 109: 53-61), Garcia-Plazaola et al. (1997, J 
Exp Bot 48: 1667-74), Schwanz et al. (1996, J Exp Bot 47L 1941-50), Koehne et al. (1999, 
Biochem Biophys Acta 1412:94-107), Melis (1984, J Cell Biochem 24: 271-85), Steindeler et al. 
(1999, Development 126: 4235-45), Cruz (1997, J Exp Bot 48: 15-24), Stephanou and Manetas 
(1997, J Exp Bot 48: 1977-85), Grammatikopoulos et al (1999, J Exp Bot 50:517-21), Krause et al. 
(1999, Plant Physiol 121 : 1349-58), Aukerman et al. (1997, Plant Cell 9: 1317-26), Wagner et al. 
(1997, Plant Cell 9: 731-43), Weinig (2000) Evolution Int J Org Evolution 54: 124-26), Cocburn et 
al. (1996, J Exp Bot 47: 647-53), Devlin et al. (1999, Plant Physiol 1 19: 909-15), Devlin et al. 
(1998, Plant Cell 10: 1479-87), Finlayson et al. (1998, Plant Physiol 1 16: 17-25), Morelli and 
Ruberti (2000, Plant Physiol 122: 621-26), Aphalo et al. (1999, J Exp Bot 50: 1629-34), Sims et al. 
(1999, J Exp Bot 50: 50: 645-53) and Ballare (1999, Trends Plant Sci 4: 97-102). 
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V.O.2. USE OF FAR RED LIGHT, SHADE AVOIDANCE RESPONSIVE 
GENES TO MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the far red light, shade avoidance responsive 
genes can be modulated to change biochemical or metabolic activities and/or pathways 
such as those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 
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BIOCHEMICAL OR 






PROCESS 


METABOLIC 


CITATIONS INCLUDING 






ACTIVITIES AND/OR 


ASSAYS 






PATHWAYS 






Cell Growth and 


Cell Elongation 


Carabelli et al. (1996) PNAS USA 




Differentiation 




93: 3530-35 






Leaf Expansion 


Heyer et al. (1995) Plant Physiol 








109: 53-61 




Photosynthesis 


Development of 


Jagtap et al. (1998) J Exp Bot 49: 


:, i?"; 




Photosynthetic Apparatus 


1715-21 








Melis (1984) J Cell Biochem 24: 








271-285 


11J 

: :J ; 






McCain (1995) Biophys J 69: 1 105- 








10 






Carotenoid Composition 


Garcia-Plazaola et al (1997) J Exp 


I1J 

4» 






Bot 48: 1667-74 




Carbon/Nitrogen 


Carbon and Nitrogen 


Cruz (1997) J Exp Bot 48: 15-24 




Metabolism 


Assimilation 






Far red light, shade 




Newton AL ? Sharpe BK, Kwan A, 




avoidance response 




Mackay JP, Crossley M. J Biol 




binding by transcription 




Chem. 2000Mayl9;275(20):15128- 




factors 




34; Lopez Ribera I ? Ruiz-Avila L, 








Puigdomenech P. Biochem Biophys 








Res Commun. 1997 Jul 








18;236(2):510-6; de Pater S, Greco 








V ? Pham K, Memelink J, Kijne J. 








Nucleic Acids Res. 1996 Dec 1; 








24(23):4624-31. 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Signaling 


UV Light Perception 


Stephanou and Manetas (1997) J 
ExpBot48: 1977-85 




Far-red/Red Light 
Perception 


Aukerman et al. (1997) Plant Cell 
9: 1317-26 

Wagner et al. (1997) Plant Cell 9: 
731-43 




Interaction of "Shade 
Factor" with Ethylene 
Production/Transduction 


Finlayson et al. (1998) Plant 
Physiol 116: 17-25 




Interaction of "Shade 
Factor" with Auxin 
Production/Transduction 


Reed et al. (1998) Plant Physiol 
118: 1369-78 




Plant to Plant signalling 


Sims et al. (1999) J Exp Bot 50: 
645-53 



Other biological activities that can be modulated by shade avoidance response genes and 
their products are listed in the REF TABLES. Assays for detecting such biological activities are 
described in the Protein Domain table. 
5 High FR:R, shade avoidance responsive genes are differentially transcribed in response to 

high FR:R ratios. The microarray comparison to reveal such genes consisted of probes prepared 
from RNA isolated from the aerial tissues of A. thaliana (Columbia) two-week old seedlings 
grown in high R:FR ratios compared to seedlings grown in high R:FR ratios followed by 4 hours 
of FR-rich light treatment. The data from this experiment reveal a number of types genes and 
10 gene products and examples of the classes of genes are given in the Table below. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders to high 


• Far red light 


• Transporters 


transcripts 


FR:R light ratios 


perception 


♦ Metabolic enzymes 






Metabolism 


Change in cell 






affected by far red 


membrane structure 




Genes induced by 


light 


and potential 




high FR:R light ratio 


• Synthesis of 


• Kinases and 






secondary 


nho snh atase s 






metabolites and/or 


• Transcription 






proteins 


activators 








• Change in chromatin 






Modulation of 


structure and/or 






high FR:R light 


localized DNA 






ratio transduction 


topology 






pathways 


• Leaf production 






• Specific gene 


factors 






transcription 








initiation 




Down-regulated 


Responders to high 


Changes in 


• Transcription factors 


transcripts 


FR:R light ratios 


pathways and 


Change in protein 






processes 


structure by 




Genes repressed by 


operating in cells 


phosphorylation 




high FR:R light ratio 


Changes in 


(kinases) or 






metabolisms other 


dephosphorylation 




Genes with 


than far red 


(phosphatases) 




discontinued 


stimulated 


Change in chromatin 




expression or 


pathways 


structure and/or DNA 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 




LEVELS 




CONSEQUENCES 


BIOCHEMICAL 
ACTIVITY 






unsTable mRNA 




topology 






during high FR:R 




♦ Stability of factors for 






ratio light 




protein synthesis and 

degradation 
♦ Metabolic enzymes 
. Cell elongation factors 


i~ 

13 








Flowering promotion 
factors 



m 
m 
m 
m 

m 
w 5 



10 



Use of Promoters of Shade Avoidance Genes 

Promoters of Shade Avoidance genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
a similar temporal, tissue, or environmentally specific patterns as the Shade Avoidance genes 
where the desired sequence is operably linked to a promoter of a Shade Avoidance gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.P. SULFUR RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Sulfur is one of the important macronutrients required by plants. It is taken up from the 
soil solution by roots as in the form of sulfate anion which higher plants are dependent on to 
fulfill their nutritional sulfur requirement. After uptake from the soil, sulfate is either 
accumulated and stored in vacuole or it is assimilated into various organic compounds, e.g. 
cysteine, glutathione, methionine, etc. Thus, plants also serve as nutritional sulfur sources for 
animals. Sulfur can be assimilated in one of two ways: it is either incorporated as sulfate in a 
reaction called sulfation, or it is first reduced to sulfide, the substrate for cysteine synthesis. In 
plants, majority of sulfur is assimilated in reduced form. 

Sulfur comprises a small by vital fraction of the atoms in many protein molecules. As 
disulfide bridges, the sulfur atoms aid in stabilizing the folded proteins, such cysteine residues. 
Cys is the first sulfur-containing amino acids, which in proteins form disulfide bonds that may 
affect the tertiary structures and enzyme activities. This redox balance is mediated by the 
disulfide/thiol interchange of thioredoxin or glutaredoxin using NADPH as an electron donor. 
Sulfur can also become sulfhydryl (SH) groups participating in the active sites of some enzymes 
and some enzymes require the aid of small molecules that contain sulfur. In addition, the 
machinery of photosynthesis includes some sulfur-containing compounds, such as ferrodoxin.. 
Thus, sulfate assimilation plays important roles not only in the sulfur nutrition but also in the 
ubiquitous process that may regulate the biochemical reactions of various metabolic pathways. 

Deficiency of sulfur leads to a marked chlorosis in younger leaves, which may become 
white in color. Other symptoms of sulfur deficiency also include weak stems and reduced 
growth. Adding sulfur fertilizer to plants can increase root development and a deeper green 
color of the leaves in sulfur-deficient plants. However, Sulfur is generally sufficient in soils for 
two reasons: it is a contaminant in potassium and other fertilizers and a product of industrial 
combustion. Sulfur limitation in plants is thus likely due to the limitation of the uptake and 
distribution of sulfate in plants. Seven cell type specific sulfate transporter genes have been 
isolated from Arabidopsis. In sulfate-starved plants, expression of the high-affinity transporter, 
AtSTl-1, is induced in root epidermis and cortex for acquisition of sulfur. The low affinity 
transporter, AtST2-l (AST68), accumulates in the root vascular tissue by sulfate starvation for 
root-to-shoot transport of sulfate. These studies have shown that the whole-plant process of 
sulfate transport is coordinately regulated by the expression of these 2 sulfate transporter genes 
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under sulfur limited conditions. Recent studies have proposed that feeding of O-acetylserine, 
GSH and selenate may regulate the expression of AtSTl-1 and AtST2-l (AST68) in roots either 
positively or negatively. However, regulatory proteins that may directly control the expression of 
these genes have not been identified yet. 

It has been established that there are regulatory interactions between assimilatory sulfate 
and nitrate reduction in plants. The two assimilatory pathways are very similar and well 
coordinated; deficiency for one element was shown to repress the other pathway. The 
coordination between them should be taken into consideration when one tries to alter one of 
pathways. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two differentially 
labeled fluorescent cDNA pools to glass slides that contain spots of DNA (Schena et al. (1995) 
Science 270: 467-70). The Arabidopsis Functional Genomics Consortium (AFGC) has recently 
made public the results from such microarray experiments conducted with AFGC chips 
containing 10,000 non-redundant ESTs, selected from 37,000 randomly sequenced ESTs 
generated from mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
databanks, and identical Ceres clones identified. MA diff and/or AFLP_diff MA_diff and/or 
AFLPdiff and/or AFLP_diff table reports the results of this analysis, indicating those Ceres 
clones which are up or down regulated over controls, thereby indicating the Ceres clones which 
are sulfur response responsive genes. 

The MA_diff and/or AFLP_diff MA_diff and/or AFLP diff and/or AFLP_diff Table(s) 
reports the transcript levels of the experiment (see EXPT ID: Sulfur (relating to SMD 8034, 
SMD 8035)). For transcripts that had higher levels in the samples than the control, a "+" is 
shown. A "-" is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Sulfur genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff and/or AFLP_diff MA_diff and/or 
AFLP_diff and/or AFLP_diff tables with a "+" or indication 
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Sulfur Genes Identified By Cluster Analyses Of Differential Expression 

Sulfur Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Sulfur genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Sulfur (relating to SMD 8034, SMD 8035) of 
the MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLPdiff table(s). 

Sulfur Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Sulfur genes. A group in the MA clust is 
considered a Sulfur pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Sulfur Genes Identified By Amino Acid Sequence Similarity 

Sulfur genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Sulfur genes. Groups of 
Sulfur genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Sulfur pathway or network 
is a group of proteins that also exhibits Sulfur functions/utilities. 

V.P.I. USE OF SULFUR RESPONSIVE GENES TO MODULATE 
PHENOTYPES 
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Sulfur responsive genes and gene products are useful to or modulate one or more 
of the following phenotypes: 
Growth 
Roots 

• Stems 

• Leaves 
Development 

Chloroplasts and Mitochondria 
Fruit Development 
Seed Development 

• Seed storage proteins 
Senescence 

• Differentiation 

Plastid/chloroplast and mitochondria differentiation 
Protection against oxidative damage 

• regulation of enzymes via redox control by thiol groups 

• Metabolic detoxification 
Photosynthesis 

• Carbon dioxide fixation 

To improve any of the phenotype(s) above, activities of one or more of the sulfur 
responsive genes or gene products can be modulated and tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and visually inspected for the desired phenotype or metabolically 
and/or functionally assayed according to Saito et al. (1994, Plant Physiol. 106: 887-95), 
Takahashi et al (1997, Proc. Natl. Acad. Sci. USA 94: 1 1 102-07) and Koprivova et al. 
(2000, Plant Physiol. 122: 737-46). 
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V.P.2. USE OF SULFUR-RESPONSIVE GENES, GENE COMPONENTS AND 

PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
The activities of one or more of the sulfur responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. 
Such biological activities can be measured according to the citations included in the 
Table below: 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 




Growth , 


• Root 


Klein and Klein (1988) Mineral 


Differentiation and 


Leaf 


Nutrition, In CM Wilson and J 


Development 


Stem 


Gregory, eds Fundamentals of 




Chloroplast/Mitochondria 


Plant Science. Harper and Row 




development/differentiation 


Publishers, Inc., NY, pi 63 






Rost et al. (1984) The Absorption 






and Transport System, In R Bern, 






ed, Botany-A Brief Introduction to 






Plant Biology. John Wiley and 






Sons, NY, p96. 






Huluigue et al. (2000) Biochem 






Biophys Res Commun 271 : 380-5 






Kapazoglou et al. (2000) Eur J 






Biochem 267: 352-60 




Seed storage protein 


Kim et al. (1999) 209: 282-9 




synthesis 




Metabolisms 


• Sulfate uptake and transport 


Takahashi et al. (1997) Proc Natl 






Acad Sci USA 94: 1 1 102-07 




Cysteine Biosynthesis 


Saito et al. (1992) Proc Natl Acad 






Sci USA 89: 8078-82 






Hesse et al. (1999) Amino Acids 






16: 113-31 




• Methionine biosynthesis 


Bourgis et al. (1999) Plant Cell 1 1 : 






1485-98 




Carbon dioxide fixation in 


Buchana (1991) Arch Biochem 




photosynthesis 


Biophys 288: 1-9 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 






Thioredoxin reduction 


Leustek and Saito (1999) Plant 






Phyiol 120: 637-43 






Mamedovaet al. (1999) FEBS 






Lett 462:421-4 




Nitrogen metabolism 


Koprivova et al. (2000) Plant 






Physiol. 122: 737-46 






Yamaguchi et al. (1999) Biosci 






Biotechnol Biochem 63: 762-6 


Plant Defenses 


• Reduction of oxidative 


Mav et al. (1998) J Exnt Bio 49* 




stress — oxygen metabolism 


649-67 




and reactive oxygen species 


Kreuz et al. (1996) Plant Physiol 




Detoxification of toxins, 


111: 349-53 




xenobiotics and heavy 


Zhao et al. (1998) Plant Cell 10: 




metals 


359-70 




• Defense against pathogens 


Kvune and Fleming H997") J Food 




or microbes 


Prot60: 67-71 




Disease prevention by 


Fahey et al. (1997) Proc Natl Acad 




secondary sulfur-containing 


Sci USA 94: 10367-72 




compounds 






Activation of kinases and 


Davis et al. (1999) Plant Cell 1 1 : 




phosphatases 


1179-90 



Other biological activities that can be modulated by the sulfur responsive genes and gene 
products are listed in the REFERENCE Table. Assays for detecting such biological activities are 
described in the Protein Domain table. 

Sulfur responsive genes are characteristically differentially transcribed in response to 
fluctuating sulfur levels or concentrations, whether internal or external to an organism or cell. 
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MAjiiff and/or AFLP diff MA diff and/or AFLP_diff and/or AFLP_diff table reports the 
changes in transcript levels of various sulfur responsive genes. 

Profiles of these different sulfur responsive genes are shown in the Table below 
with examples of associated biological activities. 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders to sulfur 


• Sulfur perception 


• TransDorters 


transcripts 


Application 


• Sulfur uptake and 


• Metabolic enzvmes 






transport 


• Change in cell 






• Sulfur metabolism 


membrane ^tnicturp 






Synthesis of 


and potential 






secondary 


• Kinases and 






metabolites and/or 


phosphatases 






proteins 


Transcription 








activators 






Modulation of 


• Change in chromatin 






sulfur response 


structure and/or 






transduction 


localized DNA 






pathways 


topology 






Specific gene 


Redox control 






transcription 








initiation 




Down-regulated 


responder to sulfur 


♦ Negative 


Transcription factors 


transcripts 


repressors of sulfur 


regulation of 


Change in protein 




state of metabolism 


sulfur pathways 


structure by 








phosphorylation 




Genes with 


Changes in 


(kinases) or 




discontinued 


pathways and 


dephosphoryaltion 




expression or 


processes 


(phosphatases) 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 






unsTable mRNA in 


operating in cells 


Change in chromatin 






presence of sulfur 


♦ Changes in other 


structure and/or DNA 








metabolisms than 


topology 








sulfur 


• Stability of factors for 










protein synthesis and 










degradation 










Metabolic enzymes 



§p Use of Promoters of Sulfur Responsive Genes 

Promoters of Sulfur responsive genes are useful for transcription of any desired 

*": J I 

polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
■^5 a similar temporal, tissue, or environmentally specific patterns as the Sulfur responsive genes 
rti where the desired sequence is operably linked to a promoter of a Sulfur responsive gene. The 
;p protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
^ the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
10 or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.Q. ZINC RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Phytoremediation of soils contaminated with toxic levels of heavy metals requires the 
understanding of plant metal transport and tolerance. The numerous Arabidopsis thaliana studies 
have given scientists the potential for dissection and elucidation of plant micronutrient/heavy 
5 metal uptake and accumulation pathways. It has been shown altered regulation of ZNT1 , a 
Zn/Cd transporter, contributes to high Zn uptake. Isolation and characterization of Zn/Cd 
hyperaccumulation genes may allow expression in higher biomass plant species for efficient 
contaminated soil clean up. Identification of additional Zn transport, tolerance and nutrition- 
related genes involved in heavy metal accumulation will enable manipulation of increased uptake 
10 (for phytoremediation) as well as limitation of uptake or leak pathways that contribute to toxicity 
; p in crop plants. Additionally, Zn-binding ligands involved in Zn homeostasis or tolerance may be 

identified, as Ayell as factors affecting the activity or expression of Zn binding transcription 
^1 factors. Gene products acting in concert to effect Zn uptake, which would not have been 
|y identified in complementation experiments, including multimeric transporter proteins, could also 
be identified. 

O Microarray technology allows monitoring of gene expression levels for thousands of 

llj genes in a single experiment. This is achieved by simultaneously hybridizing two differentially 
X labeled fluorescent cDNA pools to glass slides that contain spots of DNA (Schena et al. (1995) 

Science 270: 467-70). The Arabidopsis Functional Genomics Consortium (AFGC) has recently 
20 made public the results from such microarray experiments conducted with AFGC chips 

containing 10,000 non-redundant ESTs, selected from 37,000 randomly sequenced ESTs 

generated from mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 

controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
25 databanks, and identical Ceres clones identified. The Zn response information was then used in 

conjunction with the existing annotation to attribute biological function or utility to the full- 
length cDNA and corresponding genomic sequence. 

The MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP_diff Table(s) 

reports the transcript levels of the experiment (see EXPT ID: Zinc (relating to SMD 7310, SMD 
30 73 1 1)). For transcripts that had higher levels in the samples than the control, a "+" is shown. A 



586 



PATENT 

Attorney Docket No. 2750-1481P 
"-" is shown for when transcript levels were reduced in root tips as compared to the control. For 
more experimental detail see the Example section below. 

Zinc genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MAdiff and/or AFLP_diff MA_diff and/or 
AFLP_diff and/or AFLP diff tables with a "+" or "-" indication 

Zinc Genes Identified By Cluster Analyses Of Differential Expression 

Zinc Genes Identified By Correlation To Genes That Are Differentially Expressed 
As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Zinc genes is any group in the MA clust that comprises 
a cDNA ID that also appears in Expt ID Zinc (relating to SMD 7310, SMD 731 1) of the 
MA diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP diff table(s). 

Zinc Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations can 
be merged to identify pathways or networks of Zinc genes. A group in the MA clust is 
considered a Zinc pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Zinc Genes Identified By Amino Acid Sequence Similarity 

Zinc genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Zinc genes. Groups of 
Zinc genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Zinc pathway or network is 
a group of proteins that also exhibits Zinc functions/utilities. 
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V.Q.I. USE OF ZN TRANSPORT, TOLERANCE AND NUTRITION- 
RELATED GENES TO MODULATE PHENOTYPES 
Changes in zinc concentration in the surrounding environment or in contact with a plant 
results in modulation of many genes and gene products. Examples of such zinc responsive genes 
and gene products are shown in the Reference, Sequence tables, Protein Group, Protein Group 
Matrix, MA_diff and/or AFLP_diff MA_diff and/or AFLP_diff and/or AFLP__diff , and MA_clust 
tables. These genes and/or products are responsible for effects on traits such as plant vigor and seed 
yield. 

While zinc responsive polynucleotides and gene products can act alone, combinations of 
these polynucleotides also affect growth and development. Useful combinations include 
different zinc responsive polynucleotides and/or gene products that have similar transcription 
profiles or similar biological activities, and members of the same or similar biochemical 
pathways. In addition, the combination of a zinc responsive polynucleotide and/or gene product 
with another environmentally responsive polynucleotide is also useful because of the interactions 
that exist between hormone regulated pathways, stress pathways, nutritional pathways and 
development. Here, in addition to polynucleotides having similar transcription profiles and/or 
biological activities, useful combinations include polynucleotides that may have different 
transcription profiles but which participate in a common pathway. 

Such zinc responsive genes and gene products can function to either increase or 

dampen the above phenotypes or activities either 

• in response to changes in zinc concentration or 

• in the absence of zinc fluctuations. 

Zn transport, tolerance and nutrition-related genes and gene products can be used to 
alter or modulate one or more of the following phenotypes: 
Zn Uptake 

Transport of Zn or other heavy metals into roots 
Epidermal/cortical uptake 
Xylem loading 

• Zn compartmentation 
Xylem unloading 
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♦ Phloem loading 

• Efflux from cells to apoplast 

Sequestration in vacuoles/subcellular compartments. 

• Zn tolerance 
Chelation of Zn 

♦ Transport of Zn 

Metabolic and transcriptional control 

Activity of Zn binding enzymes 

Activity of Zn binding transcription factors 

To improve any of the phenotype(s) above, activities of one or more of the Zn transport, 
tolerance and nutrition-related genes or gene products can be modulated and the plants can be tested 
by screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be assayed, for 
example, in accordance to Lasat MM, Pence NS, Garvin DF, Ebbs SD, Kochian LV. J Exp Bot. 
2000 Jan;5 1 (342):7 1 -9; Grotz N, Fox T, Connolly E, Park W, Guerinot ML, Eide D. Proc Natl 
Acad Sci USA. 1998 Jun 9;95(12):7220-4; Crowder MW, Maiti MK, Banovic L, Makaroff CA. 
FEBS Lett. 1997 Dec l;418(3):351-4; Hart JJ, Norvell WA, Welch RM, Sullivan LA, Kochian LV. 
Plant Physiol. 1998 Sep;l 18(l):219-26. 

V.Q.2. USE OF ZN TRANSPORT, TOLERANCE AND NUTRITION- 
RELATED GENES TO MODULATE BIOCHEMICAL ACTIVITIES 

Alternatively, the activities of one or more of the zinc responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as those 
noted below. Such biological activities can be measured according to the citations 
included in the Table below: 



PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC 


ASSAYS 




ACTIVITIES AND/OR 






PATHWAYS 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Zn Uptake and 
Assimilation 


Zn Influx 


Lasat MM, Pence NS, Garvin DF, 
Ebbs SD, Kochian LV. J Exp Bot. 
2000 Jan;51(342):71-9. 




Zn compartmentation 


Hart JJ, Norvell WA, Welch RM, 
Sullivan LA, Kochian LV. Plant 
Physiol. 1998 Sep;l 18(l):219-26. 


Zn binding by metabolic 
enzymes 




Crowder MW, Maiti MK, Banovic 
L, Makaroff CA.FEBS Lett. 1997 
Dec l;418(3):351-4; Kenzior AL, 
Folk WR. FEBS Lett. 1998 Dec 
4;440(3):425-9. 


Zn binding by 
transcription factors 




Newton AL, Sharpe BK, Kwan A, 
Mackay JP, Crossley M. J Biol 
Chem. 2000Mayl9;275(20):15128- 
34; Lopez Ribera I, Ruiz-Avila L, 
Puigdomenech P. Biochem Biophys 
Res Commun. 1997 Jul 
18;236(2):510-6; de Pater S, Greco 
V, Pham K, Memelink J, Kijne J. 
Nucleic Acids Res. 1996 Dec 
1;24(23):4624-31. 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Synthesis of proteins to 
chelate Zn and other 
metals 




Schafer HJ, Greiner S, Rausch T, 
Haag-Kerwer A. FEBS Lett. 1997 
Mar 10;404(2-3):2 16-20. 

Rauser WE.Cell Biochem Biophys. 
1999;31(l):19-48. 


Synthesis of metabolites 
to chelate Zn and other 
metals 




Rauser WE.Cell Biochem Biophys. 
1999;31(l):19-48. 



Other biological activities that can be modulated by Zn transport, tolerance and nutrition- 
related genes and their products are listed in the Reference tables. Assays for detecting such 
biological activities are described in the Protein Domain table. 

Zn transport, tolerance and nutrition-related genes are differentially transcribed in 
response to low Zn concentrations. The microarray comparison consists of probes prepared from 
root RNA of A. thaliana (Columbia) seedlings hydroponically grown in complete nutrient 
medium (control) and Zn deficient seedlings grown in -Zn nutrient medium (experimental). The 
data from this experiment reveal a number of types genes and gene products. MA_diff and/or 
AFLP_diff table reports the changes in transcript levels of various zinc responsive genes in 
entire seedlings at 1 and 6 hours after a plant was sprayed with a Hoagland's solution enriched 
with zinc as compared to seedlings sprayed with Hoagland's solution only. 

The data from this time course can be used to identify a number of types of zinc 
responsive genes and gene products, including "early responding," "high zinc responders," 
"repressors of zinc deprivation pathways" and "zinc deprivation responders." Profiles of these 
different zinc responsive genes are shown in the Table below with examples of associated 
biological activities. 
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TRANSCRIPT 


TYPE OF GENE 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSequence 


BIOCHEMICAL 








ACTIVITY 


Upregulated 


Early responders to 


- Zinc Perception 


-Transcription 


transcripts (level at 


Zinc 


-Zinc Uptake 


Factors 


1 hour = 6 hours) 




- Modulation of Zinc 


-Transporters 


(level at 1 hour > 6 




Response Transduction 




hours) 




Pathways 








- Specific Gene 








Transcription Initiation 






Zinc Deprivation 




-Inhibit Transport of 




Responders 


-Repression of Pathways 


Zinc 






to Optimize Zinc 


-Degradation 






Response Pathways 




Level at 1 hour < 6 


Delayed Zinc 




-Zinc Metabolic 


hours 


Responders 




Pathways 




Repressor of Zinc 


Negative Regulation of 






Deprivation Pathways 


Zinc Pathways 




Down Regulated 


Early responder 


Negative Regulators of 


Suppressing Zinc 


transcripts (Level 


repressors of Zinc 


Zinc Utilization Pathways 


Requiring processes 


at 1 hour = 6 


utilization Pathways 






hours) 








(Level at 6 hours > 








1 hour) 








Level at 1 hour > 6 


Genes with 


Changes in pathways and 




hours 


discontinued 


processes operating I cells 






expression or 








unsTable mRNA 








following Zinc uptake 
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Use of Promoters of Zinc Responsive Genes 

Promoters of Zinc responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
5 a similar temporal, tissue, or environmentally specific patterns as the Zinc responsive genes 
where the desired sequence is operably linked to a promoter of a Zinc responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
10 or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 



m 
m 
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VI. UTILITIES OF PARTICULAR INTEREST 

Genes capable of modulating the phenotypes in the following table are useful produce the 
associated utilities in the table. Such genes can be identified by their cDNA ID number in the 
Knock-in and Knock-out tables. 



Phenotype Utilities 

Modulated by a 

Gene 



Leaf shape 



Cordate 

Cup-shaped 

Curled 

Laceolate 

Lobed 

Oval 

Ovate 

Serrate 

Trident 
Undulate 

Vertically Oblong 

Other Shapes 



Long petioles 
Short petioles 



decrease wind opacity, 

decrease lodging (plant fall over), 

increase biomass by making larger or 

different shaped leaves, 

improve the efficiency of mechanical 

harvesting, 

decrease transpiration for better drought 
tolerance, 

changing leaf shape to collect and absorb 
water, 

modulation of canopy structure and shading 
for altered irradiance close to the ground, 

enhanced uptake of pesticides (herbicides, 
fungicides, etc), 

creation of ornamental leaf shapes, 
increase resistance to pathogens by 
decreasing amount of water that collects on 
leaves, 

change proporation of cell types in the 
leaves for enhanced photosynthesis, 
decreased transpiration, and enhanced 

accumulation of desirable compounds 
including secondary metabolites in 
specialized cells, 
decrease insect feeding, 

decrease wind opacity, 

decrease lodging (plant fall over), 

increase biomass by better positioning of 

the leaf blade, 

decrease insect feeding, 

decrease transpiration for better drought 

tolerance, 

position leaves most effectively for 
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Phenotype Utilities 

Modulated by a 

Gene 

otosynthetic efficiency 



Fused 

Reduced fertility 

Short siliques 

Reduced fertility 
Sterility 

Flower size 

Stature Large 
Small 

Dwarfs 

Meristems 



ornamental applications to make distinctive 
plants, 



increase or decrease the number of seeds in 
a fruit, 

increasing fruit size, 

modulating fruit shape to better fit 

harvesting or packaging requirements, 

useful for controlling dehisence and seed 

scatter 

useful in hybrid breeding programs, 
increasing fruit size, 
production of seedless fruit, 
useful as targets for gametocides, 
modulating fruit shape to better fit 
harvesting or packaging requirements, 
useful for controlling dehisence and seed 
scatter 

useful for edible flowers 

useful for flower derived products such as 

fragrances 

useful for modulating seed size and number 
in combination with seed-specific genes 

value in the ornamental industry 

increasing or decreasing plant biomass, 
optimizing plant stature to increase yield 
under various diverse environmental 
conditions, e.g., when water or nutrients 

are limiting, 
decreasing lodging, 
increasing fruit number and size, 
controlling shading and canopy effects 

Change plant architecture, 
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Phenotype 
Modulated by a 
Gene 



Stem 



Strong 



Weak 
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Utilities 



increase or decrease number of leaves as 

well as change the types of leaves to 

increase biomass, 

improve photosynthetic efficiency, 

create new varieties of ornamental plants 

with enhanced leaf design, 

preventing flowering to opimize vegetative 

growth, 

control of apical dominace, 
increase or decrease flowering time to fit 
season, water or fertilizer schedules, 
change arrangement of leaves on the stem 
(phyllotaxy) to optimize plant density, 
decrease insect feeding, 

or decrease pathogen infection, 
increase number of trichome/glandular 
trichome producing leaves 
targets for herbicides, 
generate ectopic meristems and ectopic 
growth of vegetative and floral tissues and 
seeds and fruits 

modify lignin content/composition for 
creation of harder woods or reduce 
difficulty/costs in pulping for 

paper production or increase 
digestibility of forage crops, 
decrease lodging, 

modify cell wall polysaccharides in stems 
and fruits for improved texture and 
nutrition, 
increase biomass 



Late/Early Bolting 



Break the need for long vernalization of 
vernalization-dependent crops, e.g., winter 
wheat, thereby increasing yield 
decrease or increase generaton time 
increase biomass 



Lethals 



Embryo-lethal 



produce seedless fruit, 
use as herbicide targets 
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Phenotype Utilities 

Modulated by a 

Gene 




Pigment 



Embryo-defective 
Seedling 
Pigment-lethals 
Dark Green 



m 



I1J 



YGVl 



YGV2 



YGV3 



Interveinal chlorosis 



produce seedless fruit, 
use as herbicide targets 

use as herbicide targets, 

useful for metabolic engineering, 

use as herbicide targets, 

increase photosynthetic efficiency 

Increase nutritional value, 

enhanced photosynthesis and carbon 

dioxide combustion and therefore increase 

plant vigor and biomass, 

enhanced photosynthetic efficiency and 

therefore increase plant vigor and biomass, 

prolong vegetative development, 
enhanced protection against pathogens, 

Useful as targets for herbicides, 
increase photosynthetic efficiency and 
therefore increase plant vigor and biomass, 

Useful as targets for herbicides, 

control of change from embryonic to adult 

organs, 

increase metabolic efficiency, 
increase photosynthetic efficiency and 
therefore increased plant vigor and biomass, 

Useful as targets for herbicides, 
nitrogen sensing/uptake/usage, 
increase metabolic efficiency and therefore 
increased plant vigor and biomass, 

to increase photosynthetic efficiency and 
therefore increase plant vigor and biomass 

to increase or decrease nitrogen transport 
and therefore increase plant vigor and 
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Phenotype 
Modulated by a 
Gene 



Roots 



Short (primary root) 



Thick (primary root) 



Branching (primary 
root) 



Long (lateral roots) 



Agravitropic 

Curling (primary root) 
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Utilities 



biomass 

use as herbicide targets 
increase metabolic efficiency, 

to access water from rainfall, 

to access rhizobia spray application, for 

anaerobic soils, 

useful to facilitate harvest of root crops, 

useful for increasing biomass of root crops, 
for preventing plants dislodging during 
picking and harvesting, 
as root grafts, for animal feeds 

modulation allows betters access to water, 
minerals, fertilizers, rhizobia prevent soil 
erosion,s 

increasing root biomass 
decrease root lodging, 

modulation allows improved access to 

water, nutrients, fertilizer, rhizobia, prevent 

soil erosion 

increase root biomass 

decrease root lodging 

modulation allows control on the depth of 

root growth in soil to access water and 

nutriennts 

modulation allows hormonal control of root 
growth and development (size) 

modulation allows control on the depth of 
root growth in soil 

modulation allows hormonal control of root 

growth and development (size) 

useful in anaerobic soils in allowing roots 

to stay close to surface 

harvesting of root crops 



Poor germination 
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Phenotype 
Modulated by a 
Gene 
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Utilities 



Trichome 



Wax mutants 



Cotyledons 



Reduced Number 
Glabrous 

Increased Number 



Genes useful for decreasing transpiration, 
increased production of glandular trichomes 
for oil or other secreted chemicals of value, 

use as deterrent for insect herbivory and 
ovipostion 

modulation will increase resistance to UV 
light, 

decrease insect herbivory and oviposition, 
compostion changes for the cosmetics 
industry, 

decrease transpiration, 
provide pathogen resistance, 
UV protection, 

modulation of leaf runoff properties and 
improved access for herbicides and 
fertilizers 

modulation of seeds structure in legumes, 

increase nutritional value, 

improve seedling competion under field 

conditions, 



Seeds 



Flowers 
Hypocotysl 



Transparent testa 

Light 
Dark 

Other 

Long 



genes useful for metabolic engineering 
anthocyanin and flavonoid pathways 
improved nutritional content 

decrease petal abscission 
decrease pod shattering 

to improve germination rates 
to improve plant survivability 



Short 



to improve germination rates 
to improve plant survivability 
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VII. ENHANCED FOODS 

Animals require external supplies of amino acids that they cannot synthesize themselves. 
Also, some amino acids are required in larger quantities. The nutritional values of plants for 
animals and humans can thus be modified by regulating the amounts of the constituent amino 
acids that occur as free amino acids or in proteins. For instance, higher levels of lysine and/or 
methionine would enhance the nutritional value of corn seed. Applicants herein provide several 
methods for modulating the amino acid content: 

(1) expressing a naturally occurring protein that has a high percentage of the desired 
amino acid(s); 

(2) expressing a modified or synthetic coding sequence that has an enhanced 
percentage of the desired amino acids; or 

(3) expressing the protein(s) that are capable of synthesizing more of the desired 
amino acids. 

A specific example is expressing proteins with enhanced, for example, methionine content, 
preferentially in a corn or cereal seed used for animal nutrition or in the parts of plants used for 
nutritional purposes. 

A protein is considered to have a high percentage of an amino acid if the amount of the 
desired amino acid is at least 1% of the total number of residues in a protein; more preferably 2% 
or greater. Amino acids of particular interest are tryptophan, lysine, methionine, phenylalanine, 
threonine leucine, valine, and isoleucine. Examples of naturally occurring proteins with a high 
percentage of any one of the amino acid of particular interest are listed in the Enhanced Amino 



The sequence(s) encoding the selected protein(s) are operably linked to a promoter and 
other regulatory sequences and transformed into a plant as described below. The promoter is 
chosen optimally for promoting the desired level of expression of the protein in the selected 
organ e.g. a promoter highly functional in seeds. Modifications may be made to the sequence 
encoding the protein to ensure protein transport into, for example, organelles or storage bodies or 
its accumulation in the organ. Such modifications may include addition of signal sequences at or 
near the N terminus and amino acid residues to modify protein stability or appropriate 



Table. 
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glycosylation. Other modifications may be made to the transcribed nucleic acid sequence to 
enhance the stability or translatability of the mRNA, in order to ensure accumulation of more of 
the desired protein. Suitable versions of the gene construct and transgenic plants are selected on 
the basis of, for example, the improved amino acid content and nutritional value measured by 
standard biochemical tests and animal feeding trials. 
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VIII. USE OF NOVEL GENES TO FACILITATE EXPLOITATION OF PLANTS AS 
FACTORIES FOR THE SYNTHESIS OF VALUBLE MOLECULES 

5 Plants and their constituent cells, tissues, and organs are factories that manufacture small 

organic molecules such as sugars, amino acids, fatty acids, vitamins, etc., as well as 
macromolecules such as proteins, nucleic acids, oils/fats and carbohydrates. Plants have long 
been a source of pharmaceutically beneficial chemicals; particularly, the secondary metabolites 
and hormone-related molecules synthesized by plants. Plants can also be used as factories to 
10 produce carbohydrates or lipids that comprises a carbon backbone useful as precursors of 
O plastics, fiber, fuel, paper, pulp, rubber, solvents, lubricants, construction materials, detergents, 
. % and other cleaning materials. Plants can also generate other compounds that are of economic 

value, such as dyes, flavors, and fragrances. Both the intermediates as well as the end-products 
fll of plant bio-synthetic pathways have been found useful. 

i; j|5 With the polynucleotides and polypeptides of the instant invention, modification of both in- vitro 

and in-vivo synthesis of such products is possible. One method of increasing the amount of 
05 either the intermediates or the end-products synthesized in a cell is to increase the expression of 
one or more proteins in the synthesis pathway as discussed below. Another method of increasing 



3 production of an intermediate is to inhibit expression of protein(s) that synthesize the end- 
20 product from the intermediate. Levels of end-products and intermediates can also be modified 
by changing the levels of enzymes that specifically change or degrade them. The kinds of 
molecules made can be also be modified by changing the genes encoding specific enzymes 
performing reactions at specific steps of the biosynthetic pathway. These genes can be from the 
same or a different organism. The molecular structures in the biosynthetic pathways can thus be 
25 modified or diverted into different branches of a pathway to make novel end-products. 

Novel genes comprising selected promoters and sequences encoding enzymes are 
transformed into the selected plant to modify the levels, composition and/or structure of, without 
limitation: 

Terpenoids 
30 - Alkaloids 

Hormones, including brassinosteriods 
Flavonoids 
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Steroids 

Vitamins such as 
Retinol 
Riboflavin 
5 - Thiamine 

Caffeine 

Morphine and other alkaloids 

Peptides and amino acid synthesis 

Antioxidants 
iJP - Starches and lipids 

: S - Fatty acids 

- Fructose, mannose and other sugars 

y~i 

9"? - Glycerolipid 

IU 

- Citric acid 
Jll 5 - Lignin 

ffl - Flavors 

111 

"f* - Fragrances 

O - Essential oils 

Colors or dyes 
20 - Gum 

Gels 

Waxes 

The modifications are made by designing one or more novel genes per application 
25 comprising promoters, to ensure production of the enzyme(s) in the relevant cells, in the right 

amount, and polynucleotides encoding the relevant enzyme. The promoters and polynucleotides 
are the subject of this application. The novel genes are transformed into the relevant species 
using standard procedures. Their effects are measured by standard assays for the specific 
chemical/biochemical products. 
30 These polynucleotides and proteins of the invention that participate in the relevant 

pathways and are useful for changing production of the above chemicals and biochemicals are 
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identified in the Reference tables by their enzyme function. More specifically, proteins of the 
invention that have the following enzymatic activities are of interest to modulate the 
corresponding pathways to produce precursors or final products noted above that are of industrial 
use. Biological activities of particular interest are listed below. 

Other polynucleotides and proteins that regulate where, when and to what extent a 
pathway is active in a plant are extremely useful for modulating the synthesis and accumulation 
of valuable chemicals. These elements including transcription factors, proteins involved in 
signal transduction and other proteins in the control of gene expression are described elsewhere 
in this application. 
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Pathway 


Enzyme 


Comments 


Name 


Description 




Alkaloid 
biosynthesis I 


Morphine 6- 
dehydrogenase 


Also acts on other alkaloids, including 
codeine, normorphine and 
ethylmorphine, but only very slowly 
on 7,8-saturated derivatives such as 
dihydromorphine and dihydrocodeine 
In the reverse direction, also reduces 
naloxone to the 6-alpha- hydroxy 
analog Activated by 2- 
mercaptoethanol 




Codeinone 

reductase 

(NADPH) 


Stereospecifically catalyses the 
reversible reduction of codeinone to 
codeine, which is a direct precursor of 
morphine in the opium poppy plant, 
Papaver somniferum 




Salutaridine 

reductase 

(NADPH) 


Stereospecifically catalyses the 
reversible reduction of salutaridine to 
salutaridinol, which is a direct 
precursor of morphinan alkaloids in 
the poppy plant, Papaver somniferum 




(S)-stylopine 
synthase 


Catalyses an oxidative reaction that 
does not incorporate oxygen into the 
product Forms the second 
methylenedioxy bridge of the 
protoberberine alkaloid stylopine from 
oxidative ring closure of adjacent 
phenolic and methoxy groups of 
cheilanthifoline 




(S)-cheilanthifoline 
synthase 


Catalyses an oxidative reaction that 
does not incorporate oxygen into the 
product Forms the methylenedioxy 
bridge of the protoberberine alkaloid 
cheilanthifoline from oxidative ring 
closure of adjacent phenolic and 
methoxy groups of scoulerine 




Salutaridine 
synthase 


Forms the morphinan alkaloid 
salutaridine by intramolecular phenol 
oxidation of reticuline without the 
incorporation of oxygen into the 
product 
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( Sl^-ranaoMne 

synthase 


K^dvaiy oto ail vj/viiictii vc iCdL-LUJii Liia.1 

does not incorporate oxygen into the 
nroduct Oxidation of the 
methoxyphenol group of the alkaloid 
tetrahydrocolumbamine results in the 
formation of the methylenedioxy 
bridge of canadine 




Protopine 6- 
monooxygenase 


Involved in benzophenanthridine 
alkaloid synthesis in higher plants 




Dihvdrosansuinarin 
e 10- 

monooxygenase 


Involved in nenzonhenanthridine 

XXX V \JX V Ks\X 111 LJ\*lX£-i\J LSllWllCUlllil 

alkaloid synthesis in higher plants 




Monophenol 
monooxygenase 


A group of copper proteins that also 
catalyse the reaction of EC 1.10.3.1, if 
only 1 ,2-benzenediols are available as 
substrate 




L-amino acid 
oxidase 






1,2- 

n p n vH Torf* 1 1 r* 1 1 H n 1 1 1 

uvll y vll \JL c tic 111 1111 Ll 

m reductase 
(NADPH) 


Stereospecifically reduces the 1,2- 
uciiyurureiicuiiniurn ion 10 ^rv )- 
reticuline, which is a direct precursor 
of morphinan alkaloids in the poppy 
plant, papaver somniferum The 
enzyme does not catalyse the reverse 
reaction to any significant extent 
under nTwsiolopiral conditions 




Dihydrobenzophena 
nthridine oxidase 


Also catalyzes: dihydrochelirubine + 
0(2) = chelirubine + H(2)0(2) Also 
catalyzes: dihydromacarpine + 0(2) = 
macarpine + H(2)0(2) Found in 
higher plants Produces oxidized forms 
of the benzophenanthridine alkaloids 




Reticuline oxidase 


The product of the reaction, (S)- 
scoulerine, is a precursor of protopine, 
protoberberine and 
benzophenanthridine alkaloid 
biosynthesis in plants Acts on (S)- 
reticuline and related compounds, 
converting the N- methyl group into 
the methylene bridge fberberine 
bridge[PRIME]) of (S)- 
tetrahydroprotoberberines 
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m 





3[PRIME]- 
hydroxy-N-methyl- 
f SVcoclaurine 
4[PRIME]-0- 
methyltransferase 


Involved in isoquinoline alkaloid 
metabolism in plants Has also been 

cV»nwn to patalv^f* thf* mptViviati nn nf* 

(R,S> laudanosoline, (S)-3[PRIME]- 
hydroxycoclaurine and (R 9 S)-7-0- 
methylnoraudanosoline 




(S)-scoulerine 9-0- 

m ptVi v 1 tra n ^fprpi ^p 

l-ll^ 111 J 1 LI ClllkJ-L^l dOV 


The product of this reaction is a 

rvrppnt*Qor "for T^rotoH^rhf^ri np silLrtilniHc 

in plants 




Columbamine 0- 
methyltransferase 


The product of this reaction is a 
protoberberine alkaloid that is widely 

Hi ^tri Vuitprl in thp r-\1 ont Lr i n cj r\r\m 

Distinct in specificity from EC 
2.1.1.88 




10- 

HvHrnvvHiVivHroQfin 

1 yj vll UAy U-lliy KJLL UoUl 1 

guinarine 10-O- 
methyltransferase 


Part of the pathway for synthesis of 
uciizAjpiicii<xiiLiii iLiiiic ctiKctiuicio in 
plants 




12- 

hydroxydihydrochel 
irubine 12-0- 
methyltransferase 


Part of the pathway for synthesis of 
benzophenanthridine alkaloid 
macarpine in plants 




(R 3 S)-norcoclaurine 
methyltransferase 


Norcoclaurine is 6,7-dihydroxy-l-[(4- 
nyaroxypnenyi ^metnyij- 1 ,z, 3,4- 
tetrahydroisoquinoline The enzyme 
will also catalyse the 6-O-methylation 
of (R ? S)- norlaudanosoline to form 6- 
O-methyl-norlaudanosoline, but this 
alkaloid has not been found to occur in 




Salutaridinol 7-0- 
acetyltransferase 


At higher pH values the product, 7-0- 
acetylsalutaridinol, spontaneously 
closes the 4->5 oxide bridge by allylic 

pliminfition to form tHp mnmhitif 

V^lllllllICtLlV^ll Lu J.\J1111 lllV L/lllllt 

precursor thebaine From the opium 
poppy plant, Papaver somniferum 




Aspartate 
aminotransferase 


Also acts on L-tyrosine, L- 
phenylalanine and L-tryptophan. This 
activity can be formed from EC 
2.6.1.57 by controlled proteolysis 




Tyrosine 
aminotransferase 


L-phenylalanine can act instead of L- 
tyrosine The mitochondrial enzyme 
may be identical with EC 2.6.1.1 The 
three isoenzymic forms are 
interconverted by EC 3.4.22.4 
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Arnmfitif* inn 

acid transferase 


-L/ 111CL111VJ11111C Ivdll clloU all cli U.U11U1, 

more slowly Oxaloacetate can act as 
acceptor Controlled proteolysis 
converts the enzvme to KC 1 2 6 11 




Tyrosine 
decarboxylase 


The bacterial enzyme also acts on 3- 
hydroxytyrosine and, more slowly, on 
3 -h vdrox vnhen vl a 1 anine 




Aromatic-L-amino- 
acid decarboxylase 


Also acts on L-tryptophan, 5-hydroxy- 
T-trvntonhan and dihvdroxv- T,- 

i—/ u y u i v L/iiuii ciiiu V-Iiii y vi i ua y j i 

phenylalanine (DOPA) 


Alkaloid 

biosynthesis 

II 


Tropine 
dehydrogenase 


Oxidizes other tronan-^-alriha-nls but 

V/AlUliLViJ \J 111V1 LI \J L/CU1 _/ C1.1L/11CI UlOj l/Ul 

not the corresponding beta- 
derivatives 




Troninone 
reductase 






Hyoscyamine (6S)- 
dioxygenase 






6-beta- 

11 j ui uAy iiy Uocy ctiiii 

ne epoxidase 






A mirsp hyi H q cp* 
XTJ.11111C vJAlUcloC 

(copper-containing) 


t\. group oi enzyineb lnciuaing xnose 
oxidizing primary amines, diamines 

Jinn hi Qt^minp Otip* fr^rm nf FP 

<Xx\\X llloLClillillV^ v^llC lljllll Ul i a ^ 

1.3.1.15 from rat kidney also catalyses 
this reaction 




Putrescine N- 
methvltransferase 






Ornithine 
decarboxylase 






Oxalyl-CoA 
decarbox vl ase 






Phenylalanine 
ammonia-lyase 


May also act on L-tyrosine 


Androgen and 

estrogen 

metabolism 


3-hptp-hvHrnw- 

delta(5)-steroid 
dehydrogenase 


>rt.L/Lo \JLL J-UClct-11^ illvJAj/clllLll Uol Cll 

17-one to form androst-4-ene- 3,17- 
dione and on 3-beta-hydroxypregn-5- 
en-20-one to form progesterone 




11-beta- 

hydroxysteroid 

ueiiyurogendise 






Estradiol 17-alpha- 
dehydrogenase 






3-alpha-hydroxy-5- 
beta-androstane- 1 7- 
one 3 -alpha- 
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dehydrogenase 






3-alpha(17-beta)- 
hydroxysteroid 
dehydrogenase 
(NAD+) 


Also acts on other 1 7-beta- 
hydroxy steroids, on the 3-alpha- 
hydroxy group of pregnanes and bile 
acids, and on benzene dihydrodiol 
Different from FC 1 1 1 SO or FC 
1.1.1.213 




3-alpha- 

h vHro w<\tprni c\ 

dehydrogenase (B- 
specific) 


Acts on other 3-alpha-hydroxysteroids 

pidH on Q- 1 1 - nnH 1 S- 

hydroxyprostaglandin B-specific with 
respect to NAD(+) or NADP(+) (cf. 
EC 1.1.1.213) 




Ifor 1 7^heta- 

hydroxysteroid 

dehydrogenase 


A 1cr\ Jir*tQ on ntnpr ^-ripta- c\v 1 7-Hr1~£i- 
rviou uvio un uuiti _? uc la ui 1 / L/C/ lei 

hydroxysteroids (cfEC 1.1.1.209) 




Estradiol 1 7 beta- 
dehydrogenase 


Also acts on (S)-20-hydroxypregn-4- 
en-3-one and related compounds, 
oxidizing the (S)-20-group B-specific 
with respect to NAD(P)(+) 




Testosterone 17- 
beta-dehydrogenase 






TpQtn<stprr>np 1 7- 

beta-dehydrogenase 
(NADP+'l 


A 1 OA AY1 f\ 1 7PC *X _nA7/ r lT*r\V'\/riRYr*V\Qt*V*1"f 51 1 

to 3-oxohexobarbital 




Steroid 1 1-beta- 
monooxygenase 


Also hydroxylates steroids at the 1 8- 
position, and converts 1 8- 
hydroxycorticosterone into 
aldosterone 




Estradiol 6 -beta- 
monooxygenase 






Androst-4-ene- 

3,17-dione 

monooxygenase 


Has a wide specificity A single 
enzyme from Cylindrocarpon 
radicicola (EC 1.14.13.54) catalyses 
both this reaction and that catalysed 
by EC 1.14.99.4 




3 -oxo- 5 -alpha- 
steroid 4- 
dehydrogenase 






3-oxo-5-beta- 
steroid 4- 
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dehydrogenase 






UDP- 

glucuronosyltransfe 
rase 


Family of enzymes accepting a wide 
range of substrates, including phenols, 
alcohols, amines and fatty acids Some 
of the activities catalysed were 

picviuusiy iioic-u oCjJciiciiciy do JZiV^ 

2.4.1.42, EC 2.4.1.59, EC 2.4.1.61, EC 
2 4 1 76 FC 2 4 1 77 FC 2 4 1 R4 FC 
2.4.1.107 and EC 2.4.1.108 A 
temporary nomenclature for the 
various forms whose delineation is in 
a state of flux 




Steroid 

sul fotran sfera se 

kJ HI .1 vy 11. CUlOlvl Clow 


Broad specificity resembling EC 

9x99 nut Ftl^r* Jir*tQ c\x\ PQt"t*r*n£* 




Alcohol 

sul fotran sfera se 


Primary and secondary alcohols, 

inpluninO' almViatir* pilpnlinlQ fi^rTirhatf* 

chloramphenicol, ephedrine and 
hvdroxv steroids but not nhprtolio 
steroids, can act as acceptors (cf. EC 
2.8.2.15) 




Estrone 

sulfotransferase 






Arylsulfatase 


A group of enzymes with rather 
similar specificities 




Steryl-sulfatase 


Also acts on some related steryl 
sulfates 




1 7-alpha- 

hydroxyprogesteron 
e aldolase 






Steroid delta- 
isomerase 




C21 -Steroid 

hormone 

metabolism 


3 -beta-hy droxy- 

delta(5)-steroid 

dehydrogenase 


Acts on l-beta-hvdroxvandrost-5-en- 
17-one to form androst-4-ene- 3,17- 
dione and on 3 -beta-hy droxypregn-5- 
en-20-one to form progesterone 




11-beta- 

hydroxysteroid 

dehydrogenase 






20-alpha- 

hydroxysteroid 

dehydrogenase 


A-specific with respect to NAD(P)(+) 




3-alpha- 
hydroxysteroid 
dehydrogenase (B- 


Acts on other 3 -alpha-hydroxy steroids 
and on 9-, 11- and 15- 
hydroxyprostaglandin B-specific with 



611 



• 


• 

Attorney Docket I 




specific) 


respect to NAD(+) or NADP(+) (cf. 
EC 1.1.1.213) 




3-alpha(or 20-beta)- 

hydroxysteroid 

dehydrogenase 


The 3-alpha-hydroxyl group or 20- 

DcLd-iiyuroxyi group 01 pregnane diiu 
androstane steroids can act as donors 




Steroid 1 1 -beta- 
monooxygenase 


Also hydroxylates steroids at the 1 8- 
position, and converts 1 8- 
nyuroAycoriiLOMcronc niiu 
aldosterone 




Corticosterone 18- 
monooxygenase 






Cholesterol 
monooxygenase 
(side-chain 
cleaving) 


The reaction proceeds in three stages, 
with hydroxylation at C-20 and C-22 
prcceuing scission oi ine MCie-Lndin ai 
C-20 




Steroid 21- 
monooxygenase 






Progesterone 11- 
alpha- 

monooxygenase 






Steroid 17-alpha- 
monooxygenase 






Cholestenone 5- 
beta-reductase 






Cortisone beta- 
reductase 






Progesterone 5- 
alpha-reductase 


Testosterone and 20-alpha-hydroxy-4- 

pregnen- j-une con aci in piacc 01 
progesterone 




3-oxo-5-beta- 
steroid 4- 
dehydrogenase 






Steroid delta- 
isomerase 




Flavonoids, 
stilbene and 
lignin 

biosynthesis 


Coniferyl-alcohol 
dehydrogenase 


Specific for coniferyl alcohol; does 
not act on cinnamyl alcohol, 4- 

COUlIlaiyi alCOIlOl Ul Mllcipyi al L-OIlOl 




Cinnamyl-alcohol 
dehydrogenase 


Acts on coniferyl alcohol, sinapyl 
alcohol, 4-coumaryl alcohol and 
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cinnamyl alcohol (cf. EC 1.1.1.194) 




Dihydrokaempferol 
4-reductase 


Also acts, in the reverse direction, on 
(+)-dihydroquercetin and (+)- 
dihvdromvricetin Each 

Villi T VAA V/ilA J X till J^/C^vll 

dihydroflavonol is reduced to the 
corresponding cis-flavon- 3,4-diol 
NAD(+) can act instead of NADP(+) ? 
more slowly Involved in the 
biosynthesis of anthocyanidins in 
plants 




Flavonone 4- 
reductase 


Involved in the biosynthesis of 3- 
deoxyanthocyanidins from flavonones 
such as naringenin or eriodictyol 




Peroxidase 






Caffeate 3,4- 
dioxygenase 






Naringenin 3- 
dioxygenase 






Trans-cinnamate 4- 
monooxygenase 


Also acts on NADH, more slowly 




Trans-cinnamate 2- 
monooxygenase 






Flavonoid 

3[PRIME]- 

monooxygenase 


Acts on a number of flavonoids, 
including naringenin and 
dihydrokaempferol Does not act on 4- 
coumarate or 4-coumaroyl-CoA 




Monophenol 
monooxygenase 


A group of copper proteins that also 

catalyse the reaction of EC 1.10.3.1, if 

11^1 j*i *iii 
only 1 ,2-benzenediols are available as 

substrate 




Cinnamoyl-CoA 
reductase 


A 1 A. 1 X* 1 A.' A. A. J 

Also acts on a number of substituted 
cinnamoyl esters of coenzyme A 




caneoyi-L,oA kj- 
methyltransferase 






Luteolin O- 
methyltransferase 


Also acts on luteolin-7-O-beta-D- 
glucoside 




Caffeate O- 
methyltransferase 


3,4-dihydroxybenzaldehyde and 
catechol can act as acceptor, more 
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slowly 




Apigenin 

4[PRIME]-0- 

methyltransferase 


Converts apigenin into acacetin 
Naringenin (5,7,4[PRIME]- 
trihydroxyflavonone) can also act as 
acceptor, more slowly 




Quercetin 3-0- 
methyltransferase 


Specific for quercetin. Related 
enzymes bring about the 3-0- 
methylation of other flavonols, such as 
galangin and kaempferol 




Isoflavone-7-O- 
beta-glucoside 
6[PRIME][PRIME] 
-O- 

malonyltransferase 


The 6-position of the glucose residue 
of formononetin can also act as 
acceptor Some other 7-O-glucosides 
of isoflavones, flavones and flavonol s 
can also act, more slowly 




Pinosylvin synthase 


Not identical with EC 2.3.1.74 or EC 
2.3.1.95 




Naringenin- 
chalcone synthase 


In the presence of NADH and a 
reductase, 6[PRIME]-deoxychalcone 
is produced 




Trihydroxystilbene 
svnthase 


Not identical with EC 2.3.1.74 or EC 
2.3.1.146 




Quinate O- 

hydroxycinnamoylt 

ransferase 


Caffeoyl-CoA and 4-coumaroyl-CoA 
can also act as donors, more slowly 
Involved in the biosynthesis of 
chlorogenic acid in sweet potato and, 
with EC 2.3.1.98 in the formation of 
caffeoyl-CoA in tomato 




Coniferyl-alcohol 
glucosyltransferase 


Sinapyl alcohol can also act as 
acceptor 




2-coumarate O- 
beta- 

glucosyltransferase 


Coumarinate (cis-2- 
hydroxycinnamate) does not act as 
acceptor 




Scopoletin 
glucosyltransferase 






Flavonol-3-O- 
glucoside L- 
rhamnosyltransferas 
e 


Converts flavonol 3-0-glucosides to 
3-O-rutinosides Also acts, more 
slowly, on rutin, quercetin 3-0- 
galactoside and flavonol 03- 
rhamnosides 




Flavone 7-O-beta- 
glucosyltransferase 


A number of flavones, flavonones and 
flavonols can function as acceptors 
Different from EC 2.4.1.91 




Flavonol 3-0- 
glucosyltransferase 


Acts on a variety of flavonols, 
including quercetin and quercetin 7-0- 
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glucoside Different from EC 2.4.1 .81 




Flavone 

apiosyltransferase 


7-O-beta-D-glucosides of a number of 
flavonoid<? and of 4- miHstitutpH 
phenols can act as acceptors 




Coniferin beta- 
glucosidase 


Also hydrolyzes syringin, 4-cinnamyl 
alcohol beta-glucoside, and, more 
slowly, some other aryl beta- 
glycosides A plant cell-wall enzyme 
involved in the biosvnthesi^ of lianin 




Beta-glucosidase 


Wide specificity for beta-D- 
glucosides. Some examples also 
hydrolyse one or more of the 
following: beta-D-galactosides, alpha- 
L- arabinosides, beta-D-xylosides 5 and 
beta-D-fucosides 




Chalcone isomerase 






4-coumarate~CoA 
ligase 





m 

m 
iu 
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Pathway Name 


Enzyme 
Description 


Enzyme Comments 


Ascorbate and aldarate 
metabolism 


D-threo-aldose 
1- 

dehydrogenase 


Acts on L-fucose, D- 
arabinose and L- 
xylose The animal 
enzyme was also 
shown to act on L- 
arabinose, and the 
enzyme from 
Pseudomonas 
caryophylli on L- 
glucose 




L-threonate 3- 
dehydrogenase 






Glucuronate 
reductase 


Also reduces D- 
galacturonate May 
be identical with EC 
1.1.1.2 




Glucuronolacton 
e reductase 






L-arabinose 1- 
dehydrogenase 






L- 

galactonolactone 
oxidase 


Acts on the 1 ,4- 
lactones of L- 
galactonic, D- 
altronic, L-fuconic, 
D-arabinic and D- 
threonic acids Not 
identical with EC 
1.1.3.8 (cf. EC 
1.3.2.3) 




L-gulonolactone 
oxidase 


The product 
spontaneously 
isomerizes to L- 
ascorbate 




L-ascorbate 
oxiaase 






L-ascorbate 
peroxidase 






Ascorbate 2,3- 
d (oxygenase 






2,5- 

dioxovalerate 
dehydrogenase 





616 



PATENT 

Attorney Docket No. 2750-1481P 





Aldehyde 

dehydrogenase 

(NAD+) 


Wide specificity, 
includina oxidation 
of D- 

glucuronolactone to 
D-glucarate 




Galactonolacton 
e 

dehydrogenase 


Cf EC 1 1 3 24 




Monodehvd roas 
corbate 
reductase 
(NADH) 






Glutathione 

dehydrogenase 

(ascorbate) 






L- 

arabinonolacton 
ase 






Gluconolactonas 
e 


Acts on a wide 
range of hexono- 
1,5-lactones 




Uronolactonase 






1 ,4-lactonase 


Specific for 1 ,4- 
lactones with 4-8 
carbon atoms Does 
not hydrolyse simple 

acetylcholine, sugar 
lactones or 

^uh^titi jfprl alinh^itip 

OUUOUlUlvU 0 1 1 LJ 1 1 CI LI O 

lactones, e.g. 3- 

hydroxy-4- 

butyrolactone 




2-dehydro-3- 

deoxyglucarate 

aldolase 






L-arabinonate 
dehydratase 






Glucarate 
dehydratase 






5-dehydro-4- 

deoxyglucarate 

dehydratase 






Galactarate 
dehydratase 






2-dehydro-3- 
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HpOYV-l - 
UvUAy L_ 

arabinonate 
dehydratase 




Carbon fixation 


Malate 

dphvd rnnpna^p 


Also oxidizes some 
othpr 9- 

\J LI Id £m 

hyd roxyd ica rboxy lie 
acids 




Malate 

dehydrogenase 

( HppArhnYX/l^tinn 

lUuuu 1 UUA y 1 CI LI 1 IvJ 

) 


Does not 
decarboxylates 

auucu UAalUaUC Lei LC 




Malate 

(oxaloacetate 

decarboxylating) 

(NADP+) 


Also decarboxylates 

dUUcU vJAdlU dot? Idle? 




Malate 

dehydrogenase 


Activated by light 




Glyceraldehyde- 

3-phosphate 

dehydrogenase 

(NADP+) 

(phosphorylating 

) 






Transketolase 


Wide specificity for 
both reactants, e.g. 
converts 
hydroxy pyruvate 
and R-CHO into 
CO(2) and R- 

CH(2)OH 

Transketolase from 
Alcaligenes faecalis 
shows high activity 
with D-erythrose as 
acceptor 




Aspartate 

aminotransferas 

e 


Also acts on L- 
tvrosine L- 
phenylalanine and 
L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
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proteolysis 




Alanine 

aminotransferas 
e 


2-aminobutanoate 
acts slowly instead 
of alanine 




Sedoheptulokina 
se 






Phosphoribuloki 
nase 






P\/ri iv^tp kinp^p 

1 VI Li V CI l\l 1 ICIOw 


UTP (TTP OTP 
ITP and dATP can 
al^o apt Hnnnr^ 

O 1 0 \J Clvl CIO UUI IUI o 

Also phosphorylates 
hydroxylamine and 
fluoride in the 
presence of CO(2) 




Phosphoglycerat 
e kinase 






Pyruvate, phosph 
ate dikinase 






Fructose- 
bisphosphatase 


The animal enzyme 
also acts on 
sedohpntulo^p 1 7- 
bisphosphate 




Sedoheptulose- 
bisphosphatase 






Phosphoenolpyr 
uvate 

carboxylase 






Ribulose- 

bisphosphate 

carboxylase 


Will utilize 0(2) 
instead of CO(2), 
forminn 3-nhn^nhn- 

D-glycerate and 2- 
phosphoglycolate 




Phosphoenolpyr 
uvate 

carboxykinase 
(ATP) 






Fructose- 

bisphosphate 

aldolase 


Also acts on 
(3S,4R)-ketose 1- 
phosphates The 
yeast and bacterial 
enzymes are zinc 
proteins The 
enzymes increase 
electron-attraction 
by the carbonyl 
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group, some (Class 
i) Torming a 
protonated imine 

\A#ith it nthorc /PIqco 
Willi II, Ulllcio \L/ldoo 

II), mainly of 
microbial origin, 
polarizing it with a 
metal ion, e.g zinc 




Phosphoketolas 






Ribulose- 
epimerase 


Also converts D- 

ciylllfUoe H- 

phosphate into D- 
erythrulose 4- 
phosphate and D- 
threose 4-phosphate 




isomerase 






Ribose 5- 

phosphate 

epimerase 


Also acts on D- 
ribose 5- 

diphosphate and D- 
noose o- 
triphosphate 


Phenylalanine metabolism 


(R)-4- 

hydroxyphenylla 
ctate 

dehydrogenase 


Also acts, more 
slowly, on (R)-3- 
phenyllactate, (R)-3- 
(indole-3- yl)lactate 
ana vr\^-iaciaie 




Hydroxyphenylp 

yruvate 

reductase 


Also acts on 3-(3,4- 
dihydroxyphenyl)lact 
ate Involved with EC 
2.3.1.140 in the 
biosynthesis of 
rosmarinic acid 




Aryl-alcohol 
dehydrogenase 


A group of enzymes 
with broad specificity 
towards primary 
aiconois wim an 

al Ul 1 IdUO Ul 

cyclohex-1-ene rinq, 
but with low or no 
activity towards 
short-chain aliphatic 
alcohols 




D eroxidase 






Catechol 1,2- 


Involved in the 
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dioxygenase 


metabolism of nitro- 
aromatic 
compounds by a 

ofr^iin of 

Oil Oil 1 vl 

Pseudomonas 
putida 




2,3- 

HihvH rriY\/h^n7na 

\Jll lyUI UAy U\5\ l^KJa 

te 3,4- 
dioxygenase 






3- 

carboxyethylcate 
chol 2,3- 
dioxygenase 






Catechol 2,3- 
dioxygenase 


The enzyme from 
Alcaligines sp. strain 
0-1 has also been 
shown to catalyse 
the reaction: 3- 
Sulfocatechol + 0(2) 
+ H(2)0 = 2- 
hydroxymuconate + 
bisulfite. It has been 
rererrea to as o- 
sulfocatechol 2,3- 
dioxygenase. 
Further work will be 

noroccQ rw f/"\ chn\A/ 

whether or not this is 

C* Uloill list CI \£-y 1 1 IC 




4- 

hydroxyphenylpy 
ruvate 

dioxygenase 






Protocatechuate 
3,4-dioxygenase 






Hydroxyquinol 
1 ,2-dioxygenase 


The product 
isomerizes to 2- 
maleylacetate (cis- 
hex-2-ened ioate) 
Highly specific; 
catechol and 
pyrogallol are acted 
on at less than 1 % 
of the rate at which 
benzene-1,2,4-triol 
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is oxidized 




Protocatechuate 
4,5-dioxygenase 






Phpnvlalaninp 2- 

1 1 Id Ijf lulul Ml £— 

monooxygenase 


mow vaiciiyovo ci 

reaction similar to 
that of EC 1.4.3.2, 
forming 3- 
ohenvlovruvate 
NH(3) and H(2)0(2), 
but more slowly 




Anthranilate 1,2- 
dioxygenase 
(deaminating, 
decarboxvIatincA 






Benzoate 1 ,2- 
dioxygenase 


A system, containing 
a reductase which is 
an iron-sulfur 
flavoprotein (FAD), 
and an iron-sulfur 
oxygenase 




Toluene 
dioxygenase 


A system, containing 
a reductase which is 
an iron-sulfur 
flavoprotein (FAD), 
an iron-sulfur 
oxygenase, and a 
ferredoxin Some 
other aromatic 
compounds, 
including 

xylene and some 
halogenated 
toluenes, are 
converted into the 
corresponding cis- 
dihydrodiols 




Nanhthalene 

i iuui i li mivi iv 

1,2-dioxygenase 


A ^v^tpm nontaininn 

/ \ oy o lw ill, ww i i ici ii in iu 

a reductase which is 
an iron-sulfur 
flavoprotein (FAD), 
an iron-sulfur 
oxygenase, and 
ferredoxin 




Benzene 1,2- 
dioxygenase 


A system, containing 
a reductase which is 
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an iron-sulfur 
fiavoprotein, an iron- 
sulfur oxygenase 
and ferredoxin 




Salicylate 1- 
monooxygenase 






Trans-cinnamate 
4- 

monooxygenase 


Also acts on NADH, 
more slowly 




Rpn^nafp 4- 

monooxygenase 






4- 

hydroxybenzoat 
e3- 

monooxygenase 


Most enzymes from 
Pseudomonas are 
highly specific for 
NAD(P)H (cf EC 
1 14 13 33^ 




3- 

hydroxybenzoat 
e4- 

monooxygenase 


Also acts on a 
number of analogs 
of 3- 

hydroxybenzoate 
substituted in the 2, 
4, 5 and 6 positions 




3- 

hydroxybenzoat 
monooxygenase 


Also acts on a 
number of analogs 

Ul o- 

hydroxybenzoate 
substituted in the 2, 
4, 5 and 6 positions 
NADPH can act 
instead of NADH, 
more slowly 




4- 

hydroxybenzoat 
e3- 

monooxygenase 
(NAD(P)H) 


The enzyme from 
Corynebacterium 
cyclohexanicum is 
highly specific for 4- 
hydroxybenzoate, 

hut iiqp^ MADH anrl 

NADPH at 
approximately equal 
rates (cf. EC 
1.14.13.2). It is less 
specific for NADPH 
than EC 1.14.13.2 




Anthranilate 3- 

monooxygenase 

(deaminating) 


The enzyme from 
Aspergillus niger is 
an iron protein; that 
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from the yeast 
Trichosporon 
cutaneum is a 
flavoprotein (FAD) 




Melilotate 3- 
monooxygenase 






Phenol 2- 
monooxygenase 


Also active with 
resorcinol and O- 
cresol 




Mandelate 4- 
monooxygenase 






3- 

hydroxybenzoat 
e2- 

monooxygenase 






4-cresol 
dehydrogenase 

( h\/H roYx/lati nn\ 
lyui u Ay lain iy y 


Phenazine 
methosulfate can act 

do dUUc^piUI r\ 

quinone methide is 
probably formed as 

int^rm^Hiato The* 

II ltd 1 1 ICVJ la LC 1 1 IC 

product is oxidized 

further to 4- 

hvd roxvbenzoatp 




Benzaldehyde 

dehydrogenase 

(NAD+) 






Aminomuconate- 

semialdehyde 

dehydrogenase 


Also acts on 2- 

hydroxymuconate 

semialdehyde 




Phenylacetaldeh 
yde 

dehydrogenase 






4-carboxy-2- 

h\/H rovvmi jponat 
e-6- 

semialdehyde 
dehvd roaenase 


Does not act on 

U 1 loUUoU LU LUU 

aliphatic or aromatic 
aldehydes or 
alucose NADf+^ can 
replace NADP(+), 
but with lower 
affinity 




Aldehyde 

dehydrogenase 

(NAD(P)+) 






Benzaldehyde 
dehydrogenase 
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Coumarate 
reductase 






Cis-1,2- 

dihydrobenzene- 
1,2-diol 






Cis-1,2-dihydro- 
1,2- 

dihydroxynaphth 
alene 

dehydrogenase 


Also acts, at half the 
rate, on cis- 
anthracene 
dihydrodiol and cis- 
phenanthrene 
dihydrodiol 




2-enoate 
reductase 


Acts, in the reverse 
direction, on a wide 
range of alkyl and 
aryl alpha, beta- 
unsaiurateo 
carboxylate ions 2- 
butenoate was the 
best substrate 
tested 




Maleylacetate 
reductase 






Phenylalanine 
uenya rogenase 


The enzyme from 
Bacillus badius and 
Sporosarcina ureae 
are nigniy specmc 
for L-phenylalanine, 
that from Bacillus 
sphaericus also acts 

yji i l_ ly i uoii ic 




L-amino acid 
oxidase 






Amine oxidase 
(flavin- 
containing) 


Acts on primary 
amines, and usually 
also on secondary 
and tertiary amines 




Amine oxidase 

(copoer- 

containing) 


A group of enzymes 
includina those 
oxidizing primary 
amines, diamines 
and histamine One 
form of EC 1.3.1.15 
from rat kidney also 
catalyses this 
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reaction 




D-amino-acid 
dehydrogenase 


Acts to some extent 
on all D-amino acids 
except D-aspartate 
and D-glutamate 




Aralkylamine 
dehydrogenase 


Phenazine 
methosulfate can act 
as acceptor Acts on 
aromatic amines 
and, more slowly, on 
some long-chain 

dlipi tdUO cui initio, UUl 

not on methylamine 

nr pthx/lpmin^ fof FP 

u» en iy icu I it I ic i_ w 

1.4.99.3) 




Glutamine N- 

Dhenvlacptvltran 

sferase 






Acetyl-CoA C- 
acyltransferase 






D-flminpi-ar k iH M- 

acetyltransferas 
e 






Php*n\/l?ilanin*a 
i i ici lyiciicii in ic 

N- 

acetyltransferas 
e 


miou dus, more 
slowly, on L-histidine 
and L-alanine 




Glycine N- 

benzoyltransfera 

se 


Not identical with EC 
2.3.1.13 or EC 
2.3.1.68 




Aspartate 
aminotransferas 


Also acts on L- 
tyrosine, L- 
pnenyiaianine ana 
L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
Droteolysis 




D-alanine 

aminotransferas 

e 


Acts on the D- 

isomers of leucine, 

aspartate, 

glutamate, 

aminobutyrate, 

norvaline and 

asparagine 
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Tyrosine 

aminotransferas 

e 


L-phenylalanine can 
act instead of L- 
tyrosine The 
mitochondrial 
enzyme may be 
identical with EC 
2.6.1.1 The three 
isoenzymic forms 
are interconverted 
by EC 3.4.22.4 




Aromatic amino 
acid transferase 


L-methionine can 
also act as donor, 

i i i\Ji oiuvviy 

Oxaloacetate can 
act as acceptor 
Controlled 
nroteolv^i^ ctonx/prte 
the enzyme to EC 
2.6.1.1 




Histidinol- 
ohosDhate 
aminotransferas 
e 






3-oxoadipate 
CoA-transferase 






3-oxoadipate 
enol-lactonase 


Acts on the product 
of EC 4.1.1.44 




Carboxymethyle 
nebutenolidase 






2-pyrone-4,6- 
dicarboxylate 
lactonase 


The product 
isomerizes to 4- 
oxalmesaconate 




Hippurate 
hydrolase 


Acts on various N- 
benzoylamino acids 




Amidase 






Acylphosphatas 
e 






2- 

hydroxymuconat 

e-semialdehyde 

hydrolase 






Aromatic-L- 

amino-acid 

decarboxylase 


Also acts on L- 
tryptophan, 5- 
hydroxy-L- 
tryptophan and 
dihydroxy- L- 
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phenylalanine 
(DOPA) 




Phenylpyruvate 
decarboxylase 


Also acts on indole- 
s-pyruvate 




carboxymuconol 
actone 

decarboxylase 






o- 

pyrocatechuate 
decarboxvlase 






Phenylalanine 
decarboxvlase 


Also acts on tyrosine 
and other aromatic 
amino acids 




4_ 

hydroxybenzoat 
e decarboxylase 






Protocatechuate 
decarboxylase 






Benzoylformate 

H r^a rh r\ v\/ 1 s c ^ 






4-oxalocrotonate 
decarboxylase 


Involved in the 
meta-cleavage 
pathway for the 
degradation of 

catechols 




4-hydroxy-4- 

methyl-2- 

oxoglutarate 


Also acts on 4- 
hyd roxy-4-methy 1-2- 
oxoadipate and 4- 

c a rh n y v-4 - h vH rn y v- 

vvdl UUAy *T 1 IVUI wyvy 

2-oxohexadioate 




?-OYor>pnt-4- 

enoate 
hydratase 


Al^o arte mnrp 

slowly, on cis-2- 
oxohex-4-enoate, 
but not on the trans- 
isomer 




Phenylalanine 

■ ii \^ ■ i jf i i t 1 1 1 i 

ammonia-lyase 


Mav also act on L- 
tyrosine 




Phenylalanine 
racemase (ATP- 
hydrolysing) 






Mandelate 
racemase 






Phenylpyruvate 


Also acts on other 
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5- 

ca rboxvmeth vl- 
2- 

hydroxymuconat 
e delta- 
isomerase 






Muconolactone 
delta-isomerase 






Muconate 
cycloisomerase 


Also acts, in the 
reverse reaction, on 
3-methyl-cis-cis- 

hovo_ r\ ion o/"l i/"*oto 

and, very slowly, on 
cis-trans- 

hPY£iHi*an^Hinato Mot 

identical with EC 
5.5.1.7 or EC 
5 5 111 




3-carboxy- 

cis,cis-muconate 

cycloisomerase 






Carboxy-cis,cis- 

muconate 

cyclase 






Chloromuconate 
cycloisomerase 


Spontaneous 
elimination of HCI 
produces cis-4- 
carboxymethylenebu 
t-2- en-4-olide Also 

o r^tc in ro\/orco 
dulo 111 Itrvtilot; 

direction on 2- 
chloro-cis,cis- 
munonatp Not 
identical with EC 
5.5.1.1 or EC 
5.5.1.11 




Phenylacetate- 
CoA ligase 


Phenoxyacetate can 

replace 

phenylacetate 




Benzoate--CoA 
ligase 


Also acts on 2-, 3- 
and 4- 

fluorobenzoate, but 
only very slowly on 
the corresponding 
chlorobenzoates 
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4- 

hydroxybenzoat 
e--CoA ligase 






Phenylacetate— 

dnA linpQp 


Also acts, more 

oitjwiy, un cfll/tUcllc;, 

propanoate and 

Kiitannatp hi it nnt 

on hydroxy 
riprivativp^i nf 

phenylacetate and 
related compounds 


Phenylalanine, tyrosine and 
trvDtODhan biosvnthesis 


Quinate 5- 
dehvd ronpna^p 






Shikimate 5- 
dphvd rnnpna^p 






Quinate 
dehydrogenase 
(pyrroloquinoline 
-quinone) 






Phenylalanine 4- 
monooxygenase 






Prephenate 

nonv tr\ yt\ o r\ o o ^ 
UtJI lyUi Uy cl Id be 


This enzyme in the 
enienc Dacxena aiso 
possesses 
chorismate mutase 

aOLIVILy y L.V^ 

5.4.99.5) and 
converts chorismate 
into prephenate 




Prpnhpn^tp 

dehydrogenase 
(NADP+) 






Cyclohexadienyl 
dehydrogenase 


Also acts on 
prephenate and D- 
prephenyllactate (cf. 
EC 1.3.1.12) 




2-methyl- 
branched-chain- 

reductase 


From Ascaris suum 
The reaction 

pivjut;t;uo uiiiy in uifc; 

presence of another 

flavoprotein 

(ETF=[PRIME]Electr 

on-Transferring 

Flavoprotein[PRIME] 

) 




Phenylalanine 


The enzyme from 
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rloh\/rl rr\ nonoco 
vJci lyui uy ci idoc 


DdOlllUb UdUIUo ctNU 

Sporosarcina ureae 

prp hinhlw Qn^r'ifir^ 
die iiiLjiiiy o|^/C7L/i i 

for L-phenylalanine, 
that from Bacillus 
sphaericus also acts 
on L-tyrosine 




L-amino acid 
oxidase 






Anthranilate 

phosphoribosyltr 

ansferase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 
system Tor 
biosynthesis of 
tryptophan (EC 
4.1.1.48, EC 
4.1.3.27, EC 

AO A OCi onH FP 

5.3.1.24) 




3- 

phosphoshikimat 
e 1- 

carboxyvinyltran 
sferase 






Aspartate 
aminotransferas 


Also acts on L- 
tyrosine, L- 

pi Icliyldlcll 111 lfc? culU 

L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
proteolysis 




Tyrosine 

aminotransferas 

e 


L-phenylalanine can 
act instead of L- 
tyrosine The 
mitochondrial 
enzyme may be 
identical with EC 
2.6.1.1 The three 
isoenzymic forms 
are interconverted 
by EC 3.4.22.4 
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Aromatic amino 
acid transferase 


L-methionine can 
also act as donor, 
mnrp ^Inwlv 

1 1 1 \J 1 w 0 1 V V 1 y 

Oxaloacetate can 
act as acceptor 
Controlled 
oroteolvsis convprte 
the enzyme to EC 
2.6.1.1 




Histidinol- 
phosphate 
aminotransferas 
e 






Shikimate kinase 






lndole-3- 
glycerol- 
phosphate 
synthase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 

oyolfcMII IUi 

biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4 1 3 27 EC 
4.2.1.20, and EC 
5.3.1.24) 




2-dehydro-3- 
deoxyphosphoh 
eptonate 
aldolase 






Anthranilate 
synthase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 
system for 
biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4.1.1.48, EC 
4.2.1.20, and EC 
5.3.1.24) The native 
enzyme in the 
complex with uses 



632 



PATENT 

Attorney Docket No. 2750-1481P 







either glutamine or 
(less efficiently) 
NHf?n Thp pn7vmp 
separated from the 
complex uses NH(3) 
only 




3- 

dehydroquinate 
dehydratase 






Phosphopyruvat 
e hydratase 


Also acts on 3- 

phospho-D- 

erythronate 




Tryptophan 
synthase 


Also catalyses the 
conversion of serine 
and indole into 
tryptophan and 
water and of 
indoleglycerol 
phosphate into 
indole and 
glyceraldehyde 
phosphate In some 
organisms, this 
enzyme is part of a 
multifunctional 
protein together with 
one or more 
components of the 

biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4.1.1.48, EC 
4.1.3.27, and EC 
5.3.1.24) 




r icpi ici idle? 

dehydratase 


ThiQ pn7\/mp in thp 
i i no ci idyllic? in Li ic 

enteric bacteria also 
possesses 
chorismate mutase 
activity and converts 
chorismate into 
prephenate 




Carboxycyclohe 

xadienyl 

dehydratase 


Also acts on 
prephenate and D- 
prephenyllactate Cf. 
EC 4.2.1.51 
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3- 

dehydroquinate 
synthase 


The hydrogen atoms 
on C-7 of the 
substrate are 
retained on C-2 of 
the products 




I IUI IOI 1 Id 

synthase 


Shikimatp 

\Jl IiiaII 1 ICIIC IO 

numbered so that 
the double-bond is 
between C-1 and C- 
2, but some earlier 
papers numbered in 
the reverse direction 




Phosphoribosyla 

nthranilate 

isomerase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 

c\/ctoin fnr 
oyolfcJlll IUI 

biosynthesis of 

trvntnnhan (V-C*. 
li yLjiuLji leu i y i — v~/ 

2.4.2.18, EC 
4.1.1.48, EC 
4.1.3.27! and EC 
4.2.1.20) 




Chorismate 
mutase 






Tyrosine-tRNA 
ligase 






Phenylalanine— 
tRNA ligase 




Starch and sucrose 
metabolism 


UDP-glucose 6- 
dehydrogenase 


Also acts on UDP-2- 
deoxyglucose 




Glucoside 3- 


The enzyme acts on 

Lr UlUvUOC, \-J 

galactose, D- 
glucosides and D- 
galactosides, but D- 
glucosides react 
more rapidly than D- 
galactosides 




CDP-4-dehydro- 

6-deoxyglucose 

reductase 


Two proteins are 
involved but no 
partial reaction has 
been observed in 
the presence of 




634 



PATENT 

Attorney Docket No. 2750-1481P 







either alone 




Phosphorylase 


The recommended 
name should be 
qualified in each 
instance by adding 
the name of the 
natural suostance, 
e.g. maltodextrin 
phosphorylase, 

old i L-l 1 

phosphorylase, 

glycogen 

phosphorylase 




Levansucrase 


Some other sugars 
can act as D- 
fructosyl acceptors 




Glycogen 

(starch) 

synthase 


The recommended 
name varies 
according to the 
source of the 
enzyme and the 
nature of its 
synthetic product 
Glycogen synthase 
from animal tissues 
is a complex of a 
catalytic subunit and 
the protein 
glycogenin The 
enzyme requires 
glucosylated 
giycogenin as a 
primer; this is the 
reaction product of 
EC 2.4.1.186 A 
similar enzyme 
utilizes ADP-glucose 

(Cf FP 9 4 1 91^ 




Cellulose 
synthase (UDP- 
forming) 


Involved in the 
synthesis of 
cellulose A similar 
enzyme utilizes 
GDP-glucose (Cf. 
EC 2.4.1.29) 




Sucrose 
synthase 
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Sucrose- 
phosphate 
synthase 






Alpha,alpha- 
trehalose- 
phosphate 
synthase (UDP- 
forming) 


See also EC 
2.4.1.36 




UDP- 

glucuronosyltran 
sferase 


Family of enzymes 
accepting a wide 
range of substrates, 
including phenols, 
alcohols, amines 
and fatty acids 
Some of the 
activities catalysed 
were previously 
listed separately as 
EC 2.4.1.42, EC 
2.4.1.59, EC 
2.4.1.61, EC 

2.4.1.76, EC 

2.4.1.77, EC 
2.4.1.84, EC 

9 4 1 107 onH FP 

2.4.1.108 A 
temporary 

nomenclature for the 
various forms whose 
delineation is in a 
state of flux 




1 ,4-alpha-glucan 

branching 

enzyme 


Converts amylose 
into amylopectin The 
recommended name 
requires a 
qualification 
depending on the 
product, glycogen or 
amylopectin, e.g. 
glycogen branching 
enzyme, 
amylopectin 
branching enzyme. 
The latter has 
frequently been 
termed Q-enzyme 
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Cellobiose 
phosphorylase 






Starch (bacterial 

glycogen) 

synthase 


The recommended 
name various 
according to the 
source of the 
enzyme and the 
nature of its 

oy i iu icuu fjiuuuisi, 

e.g. starch synthase, 
bacterial glycogen 
synthase A similar 
enzyme utilizes 
UDP-glucose (Cf. 
EC 2.4.1.11) 




H dipi Id 

glucanotransfera 
se 


/All Gl IZ.y 1 1 Ills dULIVIly 

of this nature forms 
part of the 
mammalian and 
Yeast glycogen 
branching system 

pCC L V_/ \J . . 1 .00/ 




Cellulose 

<*vntha<;p ffiDP- 
forming) 


Involved in the 

Qvnth^QiQ of 
oyi ili icoio 1^11 

cellulose A similar 
enzyme utilizes 
UDP-alucose (Cf 
EC 2.4.1.12) 




1,3-beta-glucan 
synthase 






Phenol beta- 

alucosvltransfera 

se 


Acts on a wide 
ranae of Dhenols 




Amylosucrase 






Polygalacturonat 
e 4-alpha- 
galacturonosyltr 
ansferase 






Dextransucrase 






Alpha.alpha- 

trehalose 

phosphorylase 






Sucrose 
phosphorylase 


In the forward 
reaction, arsenate 
may replace 
phosphate In the 
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reverse reaction 

V C4 1 IUUO LUO^O Cl 1 1 VJ 

L-arabinose may 
replace D-fructose 




Maltose 
phosphorylase 






1 ,4-beta-D-xylan 
synthase 






Hexokinase 


D-glucose, D- 
mannose, D- 
fructose, sorbitol 
ana u-giucosamine 
can act as acceptors 

ITP onrl H ATP r»on 
1 1 i dilU Un 1 n (Jail 

act as donors The 
liver isoenzyme has 
sometimes been 
called glucokinase 




r 1 lUbpilUy IUL/Ut\l 

nase 






Glucose-1,6- 
bisphosphate 
synthase 


D-glucose 6- 
phosphate can act 
as acceptor, forming 
D-glucose 1,6- 
uibjji lUopridie 




OIUL/UI\li loot? 


A rtrr\t ir\ or»7\ /mac 

m group ot enzymes 
found in 

invertebrates and 
microorganisms 

hinhlv/ or\or k if ir* fr\r 

lll^llly o|JCL/IIILr l\Jl 

glucose 




Fructokinase 






Glucose-1- 
phosphate 
phosphodismuta 
se 






Protein-N(PI)- 

phosphohistidine 

-sugar 

ohosohotransfer 
ase 


Comprises a group 
of related enzymes 
The protein 
substrate is a 
phosphocarrier 
protein of low 
molecular mass (9.5 
Kd) A 

phosphoenzyme 
intermediate is 
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formed The enzyme 
translocates the 
sugar it 

phosphorylates into 
bacteria 

Aldohexoses and 
their glycosides and 
aiuiiois are 
phosphorylated on 
0-6; fructose and 
sorbose on 0-1 

|\ rr\ I omW 
OlyOciOi ailCJ 

disaccharides are 
also substrates 




Glucose-1- 

adenylyltransfer 
ase 






Glucose-1- 

pi lUbpi Idle? 

cytidylyltransfera 
se 






Glucose-1- 

pi lUbpi laic 

ni ismwlv/ltrancffar 
y ucu lyiyiLi d i ioici 

ase 


Also acts, more 
siowiy, on u- 

mannnco "1 _ 

phosphate 




I JTP - 

phosphate 
uridvlvltran^fpra^ 

u i iu y i y i li d i i o i v> i ao 

e 






Pectinesterase 






Trehalose- 
phosphatase 






Sucrose- 
phosphatase 






Glucose-6- 
phosphatase 


Wide distribution in 
animal tissues Also 
catalyses potent 
transphosphorylatio 
ns from ca rbamovl 
phosphate, hexose 
phosphates, 
pyrophosphate, 
phosphoenolpyruvat 
e and nucleoside di- 
and triphosphates, 
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to nil iprioo 

l\J L-/ y lUUUoc, LJ~ 

mannose, 3-methyl- 
D-glucose, or 2- 
deoxy-D-glucose (cf. 
EC 2.7.1.62, EC 
2.7.1.79, and EC 
3.9.1.1) 




Alpha-amylase 


Acts on starch, 
glycogen and 

1 cldlCU 

polysaccharides and 
oligosaccharides in 
a random manner; 
reducing groups are 
liberated in the 
alpha-configuration 




Oligo-1,6- 


Also hydrolyses 

(JdlcUfllUofc; 1 lie 

enzyme from 
intestinal mucosa is 
a single polypeptide 
chain also catalysing 
the reaction of EC 
3.2.1.48 




Maltose- 
6[PRIME]- 
phosphate 
glucosidase 


Hydrolyses a variety 
of 6-phospho-D- 
glucosides, including 
maltose 6- 
phosphate, 
alpha[PRIME]alpha- 
trehalose 6- 
phosphate, sucrose 
6-phosphate and p- 
nitrophenyl-alpha-D- 
glucopyranoside 6- 
phosphate (as a 

phrnmnnpnir 

wl If \Jl 1 IUUGI IILr 

substrate) The 
pn7vmp aptivatpH 

1 i£— y 1 i 1 IO QvLI VCIICU 

by Fe(ll), Mn(ll), 
Co(ll) and Ni(ll). It is 
rapidly inactivated in 
air 




Polygalacturona 
se 






Beta-amylase 


Acts on starch, 
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glycogen ana 
related 

polysaccharides and 
oligosaccharides 
producing beta- 
maltose by an 
inversion 




Alpha- 
glucosidase 


Group of enzymes 
whose specificity is 
directed mainly 
towards the 
exohydrolysis of 1 ,4- 
alpha-glucosidic 
linkages, and that 
hydrolyse 
oligosaccharides 
rapidly, relative to 
polysaccharides, 
which are 

hydrolysed relatively 
slowly, or not at all 
The intestinal 
enzyme also 
hydrolyses 
poiysaccnariaes, 
catalysing the 
reactions of EC 
3.2.1.3, and, more 
slowly, hydrolyses 
1,6-alpha- D- 
glucose links 




Beta- 

glucosidase 


Wide specificity for 
beta-D-glucosides. 
Some examples 
also hydrolyse one 

Ul IflUlt? Ul lllc 

following: beta-D- 
galactosides, alpha- 
L- arabinosides, 
beta-D-xylosides, 
and beta-D- 
fucosides 




Beta- 

fructofuranosida 
se 


Substrates include 
sucrose Also 
catalyses 
fructotransferase 
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reactions 




Alpha, alpha- 
trehalase 




• 


Glucan 1 ,4- 

alpha- 

glucosidase 


Most forms of the 
enzyme can rapidly 
hydrolyse 1,6-alpha- 
D-glucosidic bonds 
when the next bond 
in sequence is 1,4, 
and some 
preparations of this 
enzyme hydrolyse 
1,6- and 1,3-alpha- 
D-glucosidic bonds 
in other 

polysaccharides 
This entry covers all 
such enzymes 
acting on 
nolv^annharidp^ 
more rapidly than on 
oliaosaccharides EC 
3.2.1.20 from 
mammalian intestine 
can catalyse similar 
reactions 




Beta- 

glucuronidase 






Amylo-1,6- 
glucosidase 


In mammals and 
yeast this enzyme is 
linked to a 
a Ivco w 1 1 ra n ^fp ra 

\A IV www V 1 LI C4 1 1 0 1 1 Glw^ 

similar to EC 
2.4.1.25; together 
these two activities 
constitute the 
glycogen 

debranching system 




Xylan 1,4-beta- 
xylosidase 


Also hydrolyses 
xylobiose Some 
other 

exoglycosidase 
activities have been 
found associated 
with this enzyme in 
sheep liver 
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Glucan endo- 
1 3-hpta-n- 
glucosidase 


Very limited action 

on mi YAH-lirtlf ( *1 *\ 
1 1 HXc?U~"lli Il\ ^ 1 T 0 - 

1 ,4-)-beta-D-glucans 
Hydrolyses 
laminarin, 
paramylon and 

narhvirian Diffprpnt 
from EC 3.2.1.6 




Cellulase 


Will also hydrolyse 
1,4-linkages in beta- 
D-glucans also 
containing 1,3- 
linkages 




Sucrose alpha- 
glucosidase 


This enzyme is 
isolated from 
iniesiinai mucosa as 
a single polypeptide 

phain o |qp> Hicrilawinn 
oiidiii dloU Ulopictyliiy 

activity towards 
isomaltose (oligo- 
1,6-glucosidase, cf. 
FC 3 2 1 irn 




C vcloma Itodpytri 
nase 


AIqo hvHrolv/QPQ 

mou i lyui uiyoco 

linear maltodextrin 




Glucan 1,3-beta- 
olucosidasp 


Acts on 

olino^apph^riHp^ hi it 
very slowly on 
laminaribiose 




Levanase 






Galacturan 1 ,4- 
alpha- 

aalacturoniri?3<5p 






Glucan 1 ,4-beta- 
glucosidase 


Acts on 1 ,4-beta-D- 
glucans and related 

olifin^apph^rirlpQ 

Cellobiose is 
hydrolysed, very 
slowly 




Cellulose 1,4- 
beta- 

cellobiosidase 






Alpha,alpha- 

phosphotrehalas 

e 






ADP-sugar 
diphosphatase 


Has a distinct 
specificity from the 
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UDP-sugar 1 
pyrophosphatase 
(EC 3.6.1.45) 




Nucleotide 

pyrophosphatas 

e 


Substrates include 
NAD(+), NADP(+), 
FAD, CoA and also 
ATP and ADP 




UDP- 

glucuronate 
decarboxylase 






CDP-glucose 
4,6-dehydratase 






CDP-abequose 
epimerase 






UDP- 

glucuronate 4- 
epimerase 






Glucose-6- 
phosphate 
isomerase 


Also catalyses the 
anomerization of D- 
glucose 6- 
phosphate 




Phosphoglucom 
utase 


Maximum activity is 
only obtained in the 
presence of alpha- 
D-glucose 1,6- 
bisphosphate. This 
bisphosphate is an 
intermediate in the 
reaction, being 
formed by transfer of 
a phosphate residue 
from the enzyme to 
the substrate, but 
the dissociation of 
bisphosphate from 
the enzyme complex 
is much slower than 
the overall 
isomerization Also, 
more slowly, 
catalyses the 
interconversion of 1- 
phosphate and 6- 
phosphate isomers 
of many other alpha- 
D-hexoses, and the 
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interconversion of 
alpha-D-ribose 1- 
phosphate and 5- 
phosphate 




tseta- 

phosphoglucom 

Uldoc 






Maltose alpha- 
D- 

glucosyltransfera 
se 




Tryptophan metabolism 


lndole-3-lactate 
dehydrogenase 






lndole-3- 
acetaldehyde 
reductase 
(NADH) 






lndole-3- 
acetaldehyde 
reductase 
(NADPH) 






3-hydroxyacyl- 
CoA 

dehydrogenase 


Also oxidizes S-3- 
hydroxyacyl-N- 
acylthioethanolamin 
e and S-3- 
hydroxyacylhydrolip 
oate Some enzymes 
act, more slowly, 
with NADP(+) Broad 
specificity to acyl 
chain-length (cf. EC 

A A A O A A \ 




O-aminophenol 
oxidase 


Isophenoxazine may 
be formed by a 
secondary 
condensation from 
the initial oxidation 
product 




Catalase 


This enzyme can 

dloU dL/L do d 

peroxidase (EC 
1.11.1.7) for which 
several organic 
substances, 
especially ethanol, 
can act as a 
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hydrogen donor A 
manganese protein 
containing Mn(lll) in 
the resting state, 
which also belongs 
here, is often called 
pseudocatalase 
Enzymes from some 
microorganisms, 
such as Penicillium 
simplicissimum, 
which exhibit both 
catalase and 
peroxidase activity, 
have sometimes 
been referred to as 
catalase-peroxidase 




7,8- 

dihydroxykynure 
nate 8,8A- 
dioxygenase 






Tryptophan 2,3- 
dioxygenase 


Broad specificity 
towards tryptamine 
and derivatives 
including D- and L- 
tryptophan, 5- 
hydroxytryptophan 
and serotonin 




Indole 2,3- 
dioxygenase 


The enzyme from 
jasminum is a 
flavoprotein 
containing copper, 
and forms 
anthranilate as the 
final product One 
enzyme from 
Tecoma stans is 
also a flavoprotein 
containing copper 
diiu uses xnree 
atoms of oxygen per 
molecule of indole, 
to form anthranil 
(3,4-benzisoxazole) 
A second enzyme 
from Tecoma stans, 
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which is not a 
flavoprotein, uses 
tout aioms ot 
oxygen and forms 
anthranilate as the 
final product 




2,3- 

dihydroxyindole 
2,3-dioxygenase 






Indoleamine- 
pyrrole z,o- 
dioxygenase 


Acts on many 
substituted and 
unsubstituted 
indoleamines, 
including melatonin 
Involved in the 
degradation of 
melatonin 




o- 

hydroxyanthranil 
ate 3,4- 
dioxygenase 


The product of the 
reaction 
spontaneously 
rearrange to 
quinolinic acid (quin) 




Tryptophan 2- 
monooxygenase 






Tryptophan 

2[PRIME]- 

dioxygenase 


Acts on a number of 
indolyl-3-alkane 
derivatives, oxidizing 
the 3-side-chain in 
the2[PRIME]- 
position, best 
substrates are L- 
tryptophan and 5- 
hydroxy-L- 
tryptophan 




Kynurenine 3- 
monooxygenase 






Unspecific 
monooxygenase 


Acts on a wide 
range of substrates 
including many 

Xcl lUDIUUUb, 

steroids, fatty acids, 
vitamins and 
prostaglandins 
Reactions catalysed 
include 
hydroxylation, 
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epoxidation, N- 
oxidation, 
sulfooxidation, N-, 
S- and O- 
dealkylations, 
destination, 
deamination, and 
reduction of azo, 
nitro, and N-oxide 
groups 




Antnranilate 3- 
monooxygenase 






Tryptophan 5- 
monooxygenase 


Activated by 
phosphorylation, 
catalysed by a 
CA(2+)-activated 
protein kinase 




Kynurenine 7,8- 
hydroxylase 






Aldehyde 

dehydrogenase 

(NAD+) 


Wide specificity, 
including oxidation 
of D- 

glucuronolactone to 
D-glucarate 




Aminomuconate- 

semialdehyde 

dehydrogenase 


Also acts on 2- 
hydroxy muconate 
semialdehyde 




Aldehyde 
oxidase 


Also oxidizes 
quinoline and 
pyridine derivatives 
May be identical 
with EC 1.1.3.22 




lndole-3- 

acetaldehyde 

oxidase 


Also oxidizes indole- 
3-aldehyde and 
acetaldehyde, more 
slowly 




Oxoglutarate 

dehydrogenase 

(lipoamide) 


Component of the 
multienzyme 2- 
oxoglutarate 

H^h\/H rnnonaco 
uci lyui uy d idoc 

complex 




Kynurenate-7,8- 

dihydrodiol 

dehydrogenase 






Glutaryl-CoA 
dehydrogenase 
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L-amino acid 
oxidase 






Amine oxidase 
(flavin- 
containing) 


Acts on primary 
amines, and usually 
also on secondary 
and tertiary amines 




Amine oxidase 
(copper- 
containing) 


A group of enzymes 
including those 
oxidizing primary 
amines, diamines 
and histamine One 
form of EC 1.3.1.15 
from rat kidney also 
catalyses this 
reaction 




Acetylindoxyl 
oxidase 






Acetylserotonin 

U- 

methyltransferas 
e 


Some other 
hydroxyindoles also 
act as acceptor, 
more slowly 




Indole-3- 
pyruvate C- 
methyltransferas 
e 






Amine N- 

methyltransferas 

e 


A wide range of 
primary, secondary, 
and tertiary amines 
can act as 
acceptors, including 
tryptamine, aniline, 
nicotine and a 
variety of drugs and 
other xenobiotics 




Aralkylamine N- 
acetyltransferas 
e 


Narrow specificity 
towards 
aralkylamines, 
including serotonin 
Not identical with EC 
2.3.1.5 




Acetyl-CoA C- 

acetyltransferas 

e 






Tryptophan 

aminotransferas 

e 


Also acts on 5- 
hydroxytryptophan 
and, to a lesser 
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extent on the phenyl 
amino acids 




Kynurenine— 
oxogiutaraie 
aminotransferas 
e 


Also acts on 3- 
hydroxykynurenine 




Thioglucosidase 


Has a wide 
specificity for 
thioglycosides 




Amidase 






Formamidase 


Also acts, more 
slowly, on 
acetamide, 
propanamide and 
butanamide 




Arylformamidase 


Also acts on other 

aromatic 

formylamines 




Nitrilase 


Acts on a wide 
range of aromatic 
nitriles including 
(indole-3- yl)- 
acetonitrile and also 
on some aliphatic 
nitriles, and on the 
corresponding acid 
amides (cf. EC 
4.2.1.84) 




Kynureninase 


Also acts on 

3[PRIME]- 

hydroxykynurenine 

and some other (3- 

arylcarbonyl)- 

alanines 




Aromatic-L- 

amino-acid 

decarboxylase 


Also acts on L- 
tryptophan, 5- 
hydroxy-L- 
tryptophan and 
di hydroxy- L- 

nhonv/lQlsnino 
yj\ ici lyidicn ill ic2 

(DOPA) 




Phenylpyruvate 
decarboxylase 


Also acts on indole- 
s-pyruvate 




Aminocarboxym 

uconate- 

semialdehyde 


The product 
rearranges non- 
enzymically to 
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decarboxylase 


picolinate 




Tryptophanase 


Also catalyses the 
synthesis of 
tryptophan from 
indole and serine 
Also catalyses 2,3- 
elimination and 
beta-replacement 

J." -T 

reactions of some 
indole-substituted 
tryptophan analogs 
of L-cysteine, L- 
serine and other 3- 
substituted amino 
acids 




Enoyl-CoA 
hydratase 


Acts in the reverse 
direction With cis- 
compounds, yields 
(3R)-3-hydroxyacyl- 
CoA (cf. EC 
4.2.1.74) 




Nitrile hydratase 


Acts on short-chain 
aliphatic nitriles, 
converting them into 
the corresponding 
acid amides Does 
not act on these 
amides or on 
aromatic nitriles (cf 
EC 3.5.5.1) 




Tryptophan- 
tRNA ligase 




Tyrosine metabolism 


A 1 _ | _ 1 

Alcohol 

dehydrogenase 


Acts on primary or 
secondary alcohols 
or hemiacetals The 
animal, but not the 
yeast, enzyme acts 
also on cyclic 
secondary alcohols 




hydroxyphenylla 
ctate 

dehydrogenase 


MloU dULb, lllUltr 

slowly, on (R)-3- 
phenyllactate, (R)-3- 
(indole-3- yl)lactate 
and (R)-lactate 




Hydroxyphenylp 
yruvate 


Also acts on 3-(3,4- 
dihydroxyphenyl)lact 
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reductase 


ate Involved with EC 
2.3.1.140 in the 
biosynthesis of 
rosmarinic acid 




Aryl-alcohol 
dehydrogenase 


A group of enzymes 
with broad specificity 
towards primary 
alcohols with an 
aromatic or 
cyclohex-1-ene ring, 
but with low or no 
activity towards 
short-chain aliphatic 
alcohols 




Catechol 
oxidase 


Also acts on a 
variety of substituted 
catechols Many of 
these enzymes also 
catalyse the reaction 
listed under EC 
1.14.18.1; this is 
especially true for 
the classical 
tyrosinase 




Iodide 
peroxidase 






3.4- 

dihydroxyphenyl 
acetate 2,3- 
dioxygenase 






4- 

hydroxyphenylpy 
ruvate 

dioxygenase 






Stizolobate 
synthase 


The intermediate 
product undergoes 
ring closure and 
oxidation, with 
NAD(P)(+) as 

stizolobic acid 




Stizolobinate 
synthase 


The intermediate 
product undergoes 
ring closure and 
oxidation, with 
NAD(P)(+) as 
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acceptor, to 
stizolobinic acid 




Gentisate 1,2- 
dioxygenase 






Homogentisate 
1,2-dioxygenase 






4- 

hydroxyphenylac 
etate 1- 

monooxygenase 


Also acts on 4- 
hydroxyhydratropate 
forming 2- 
methylhomogentisat 
e and on 4- 
hydroxyphenoxyacet 
ate forming 
hydroquinone and 
glycolate 




4- 

hydroxyphenylac 
etate 3- 

monooxygenase 






Tyrosine N- 
monooxygenase 






■ 1 ■ i i 

Hydroxyphenyla 

cetonitrile 2- 
monooxygenase 






Tyrosine 3- 
monooxygenase 


Activated by 
phosphorylation, 
catalysed by EC 
2.7.1.128 




Dopamine-beta- 
monooxygenase 


Stimulated by 
fumarate 




Monophenol 
monooxygenase 


A group of copper 
proteins that also 
catalyse the reaction 
of EC 1.10.3.1, if 
only 1,2- 
benzenediols are 
available as 
substrate 




Succinate- 

dehydrogenase 
(NAD(P)+) 






Aryl-aldehyde 
dehydrogenase 


Oxidizes a number 
of aromatic 
aldehydes, but not 
aliphatic aldehydes 
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Aldehyde 

dehydrogenase 

(NAD+) 


Wide specificity, 
including oxidation 
of D- 

glucuronolactone to 
D-glucarate 




4-carboxy-2- 
nyu roxy rnuconai 
e-6- 

semialdehyde 

Ucl lyUI uy ci Idot; 


Does not act on 
unsuiJSiiiuieQ 
aliphatic or aromatic 
aldehydes or 

n It ir^Pico NIAH/ A-\ ran 

replace NADP(+), 
but with lower 
affinity 




Aldehyde 
aenya rogenase 
(NAD(P)+) 






4- 

hydroxyphenylac 
etaldehyde 

uenya rogenase 


With EC 4.2.1.87, 
brings about the 
metabolism of 
octopamine in 
Pseudomonas 




Miuenyue 
oxidase 


MlbO OXIUIZc;o 

quinoline and 
pyridine derivatives 
May be identical 

with FP 1 1 199 

Willi 1. 1 




L-amino acid 
oxidase 






Amine oxidase 
(flavin- 
containing) 


Acts on primary 
amines, and usually 
also on secondary 
and tertiary amines 




Amine oxidase 

(copper- 

containing^ 


A group of enzymes 
including those 
oxiuizing primary 
amines, diamines 
and histamine One 
form of EC 1.3.1.15 
from rat kidney also 
catalvses this 
reaction 




Aralkylamine 
dehydrogenase 


Phenazine 
methosulfate can act 
as acceptor Acts on 
aromatic amines 
and, more slowly, on 
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some long-chain 
aliphatic amines, but 
not on metnyiamme 
or ethylamine (cf EC 
1.4.99.3) 




rrienoi U- 

methyltransferas 

e 


Acts on a wide 
variety of simple 
alkyh methoxy- and 
halo-phenols 




Tyramine N- 

methyltransferas 

e 


Has some activity on 

phenylethylamine 

analogs 




Phenylethanola 
mine N- 

methyltransferas 
e 


Acts on various 
phenylethanolamine 
s; converts 
noradrenalin into 
adrenalin 




Catechol O- 

methyltransferas 

e 


The mammalian 
enzymes act more 
rapidly on 
catecholamines 
such as adrenaline 
or noradrenaline 
than on catechols 




Glutamine N- 

phenylacetyltran 

sferase 






Rosmarinate 
synthase 


Involved with EC 

AAA 007 : H iL 

1 .1 .1 .237 in the 
biosynthesis of 
rosmarinic acid 




Hydroxymandelo 
nitrile 

glucosyltransfera 
se 


3,4- 

dihydroxymandelonit 
rile can also act as 
acceptor 




Aspartate 

aminotransferas 

e 


Also acts on L- 
tyrosine, L- 
phenylalanine and 
L-tryptophan. This 

or^ix/itx/ fan ho 
caUUVIiy Udll uc 

formed from EC 
2.6.1.57 by 
controlled 
proteolysis 




Dihydroxyphenyl 
alanine 
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aminotransferas 
e 






Tyrosine 

aminotransferas 

e 


L-phenylalanine can 
act instead of L- 
tyrosine The 
mitochondrial 
enzyme may be 
identical with EC 
2.6.1.1 The three 
isoenzymic forms 
are interconverted 
by EC 3.4.22.4 




Aromatic amino 
acid transferase 


L-methionine can 
also act as donor, 
more slowly 
Oxaloacetate can 
act as acceptor 
Controlled 
proteolysis converts 
the enzyme to EC 
2.6.1.1 




Histidinol- 
phosphate 
aminotransferas 
e 






Fumarylacetoac 
etase 


Also acts on other 
3,5- and 2,4-dioxo 
acids 




Acylpyruvate 
hydrolase 


Acts on 

formylpyruvate, 2,4- 
dioxopentanoate, 
2,4-dioxohexanoate 
and 2,4- 

dioxoheptanoate 




Tyrosine 
decarboxylase 


The bacterial 
enzyme also acts on 
3-hydroxytyrosine 
and, more slowly, on 
3- 

f lyuroxypnenyiaianin 
e 




Aromatic-L- 

amino-acid 

decarboxylase 


Also acts on L- 
tryptophan, 5- 
hydroxy-L- 
tryptophan and 
dihydroxy- L- 
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phenylalanine 

(UUrA) 




Gentisate 
decarboxylase 






5-oxopent-3- 
ene-1,2,5- 
tricarboxylate 
decarboxylase 






Tyrosine phenol- 
lyase 


Also slowly 
catalyses pyruvate 
formation from D- 
tyrosine, S-methyl- 
L-cysteine, L- 
cysteine, L-serine 
and D-serine 




(S)- 

norcoclaurine 
synthase 


The reaction makes 
a 6-membered ring 
by forming a bond 
between C-6 of the 
3,4-dihydroxyphenyl 
group of the 
dopamine and C-1 
of the aldehyde in 
the imine formed 
between the 
substrates The 
product is the 
precursor of the 
benzylisoquinoline 
alkaloids in plants 
Will also catalyse 
the reaction of 4-(2- 
aminoethyl)benzene 
-1,2-diol + (3,4- 
dihydroxyphenyl)ace 
taldehyde to form 
(S)- 

norlaudanosoline, 
but this alkaloid has 

not hp^n fminH to 

occur in plants 




Dihydroxyphenyl 
alanine 

ammonia-lyase 






Phenylalanine 
ammonia-lyase 


May also act on L- 
tyrosine 
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Maleylacetoacet 
ate isomerase 


Also acts on 
maleylpyruvate 




Maleylpyruvate 
isomerase 






Phenylpyruvate 
tautomerase 


Also acts on other 
arylpyruvates 




5- 

carboxymethyl- 
2- 

hydroxymuconat 
e delta- 
isomerase 






Tyrosine 2,3- 
aminomutase 






Phenylacetate-- 


Also acts, more 
siowiy, on acetate, 
propanoate and 
butanoate, but not 
on hydroxy 
derivatives of 
phenylacetate and 
related compounds 
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IX. PROMOTERS AS SENTINELS 

Useful promoters include those that are capable of facilitating preferential transcription, i.e. 
tissue-specific or developmentally regulated gene expression and being a component of facile 
systems to evaluate the metabolic/physiological state of a plant cell, tissue or organ. Many such 
promoters are included in this application. Operably linking a sequence to these promoters that can 
act as a reporter and inserting the construct into a plant allows detection of the preferential in plantar 
transcription. For example, the quantitative state of responses to environmental conditions can be 
detected by using a plant having a construct that contains a stress-inducible promoter linked to and 
controlling expression of a sequence encoding GFP. The greater the stress promoter is induced, the 
greater the levels of fluorescence from GFP will be produced and this provides a measure of the 
level of stress being expressed by the plant and/or the ability of the plant to respond internally to the 
stress. 

More specifically, using this system the activities of any metabolic pathway (catabolic and 
anabolic), stress-related pathways as on any plant gene repeated activity can be monitored. In 
addition, assays can be developed using this sentinel system to select for superior genotypes with 
greater yield characteristics or to select for plants with altered responses to chemical, herbicide, or 
plant growth regulators or to identify chemical, herbicides or plant growth regulators by their 
response on such sentinels. 

Specifically, a promoter that is regulated in plants in the desired way, is operably linked to a 
reporter such as GFP, RFP, etc., and the constructs are introduced into the plant of interest. The 
behavior of the reporter is monitored using technologies typically specific for that reporter. With 
GFP, RFP, etc., it could typically be by microscopy of whole plants, organs, tissues or cells under 
excitation by an appropriate wavelength of UV light. 
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BRIEF DESCRIPTION OF THE TABLES 



BRIEF DESCRIPTION OF THE TABLES 



O 



5 1 . Reference and Sequence Tables 

The sequences of exemplary SDFs and polypeptides corresponding to the coding sequences 
of the instant invention are described in the Reference and Sequence Tables (sometimes referred 
to as the REF and SEQ Tables. The Reference Table refers to a number of "Maximum Length 
10 Sequences" or "MLS." Each MLS corresponds to the longest cDNA obtained, either by cloning 
or by the prediction from genomic sequence. The sequence of the MLS is the cDNA sequence as 
described in the Av subsection of the Reference Table, 
ft The Reference Table includes the following information relating to each MLS: 

■ ft L cDNA Sequence 

J5 A. 5' UTR 

US B. Coding Sequence 

m C. 3' UTR 

Sfj II. Genomic Sequence 

:^ A. Exons 

40 B. Introns 

C. Promoters 

III. Link of cDNA Sequences to Clone IDs 

IV. Multiple Transcription Start Sites 
: ,p V. Polypeptide Sequences 

d 5 A. Signal Peptide 

B. Domains 

C. Related Polypeptides 

VI. Related Polynucleotide Sequences 

30 I. cDNA SEQUENCE 

The Reference Table indicates which sequence in the Sequence Table represents the 
sequence of each MLS. The MLS sequence can comprise 5' and 3' UTR as well as coding 
sequences. In addition, specific cDNA clone numbers also are included in the Reference Table 
when the MLS sequence relates to a specific cDNA clone. 
35 A. 5' UTR 

The location of the 5' UTR can be determined by comparing the most 5' MLS sequence 
with the corresponding genomic sequence as indicated in the Reference Table. The sequence 
that matches, beginning at any of the transcriptional start sites and ending at the last nucleotide 
before any of the translational start sites corresponds to the 5' UTR. 
40 B. Coding Region 

The coding region is the sequence in any open reading frame found in the MLS. Coding 
regions of interest are indicated in the PolyP SEQ subsection of the Reference Table. 
C. 3' UTR 
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The location of the 3' UTR can be determined by comparing the most 3' MLS sequence 
with the corresponding genomic sequence as indicated in the Reference Table. The sequence 
that matches, beginning at the translational stop site and ending at the last nucleotide of the MLS 
corresponds to the 3 ' UTR. 

II. GENOMIC SEQUENCE 

Further, the Reference Table indicates the specific "gi" number of the genomic sequence 
if the sequence resides in a public databank. For each genomic sequence, Reference tables 
indicate which regions are included in the MLS. These regions can include the 5' and 3' UTRs 
as well as the coding sequence of the MLS. See, for example, the scheme below: 



Region 1 



Region 2 



Region 3 



•j 5' UTR j Exon |- 



I Exon | - 



■ | Exon | 3 ' UTR | ■ 



Promoter | Intron 

Translational 
Start Site 



I 

Intron 



Stop Codon 



The Reference Table reports the first and last base of each region that are included 
in an MLS sequence. An example is shown below: 
gi No. 47000: 
37102 ... 37497 
37593 ... 37925 

The numbers indicate that the MLS contains the following sequences from two regions of 
gi No. 47000; a first region including bases 37102-37497, and a second region including bases 
37593-37925. 



A. EXON SEQUENCES 
The location of the exons can be determined by comparing the sequence of the 
regions from the genomic sequences with the corresponding MLS sequence as indicated by the 
Reference Table. 



i. INITIAL EXON 
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To determine the location of the initial exon, information from the 

(1) polypeptide sequence section; 

(2) cDNA polynucleotide section; and 

(3) the genomic sequence section 

of the Reference Table is used. First, the polypeptide section will indicate where the 
translational start site is located in the MLS sequence. The MLS sequence can be matched to the 
genomic sequence that corresponds to the MLS. Based on the match between the MLS and 
corresponding genomic sequences , the location of the translational start site can be determined 
in one of the regions of the genomic sequence. The location of this translational start site is the 
start of the first exon. . 

Generally, the last base of the exon of the corresponding genomic region, in which the 
translational start site was located, will represent the end of the initial exon. In some cases, the 
initial exon will end with a stop codon, when the initial exon is the only exon. 

In the case when sequences representing the MLS are in the positive strand of the 
corresponding genomic sequence, the last base will be a larger number than the first base. When 
the sequences representing the MLS are in the negative strand of the corresponding genomic 
sequence, then the last base will be a smaller number than the first base. 

ii. INTERNAL EXONS 

Except for the regions that comprise the 5' and 3' UTRs, initial exon, and terminal exon, 
the remaining genomic regions that match the MLS sequence are the internal exons. 
Specifically, the bases defining the boundaries of the remaining regions also define the 
intron/exon junctions of the internal exons. 

iii. TERMINAL EXON 

As with the initial exon, the location of the terminal exon is determined with information 
from the 

(1) polypeptide sequence section; 

(2) cDNA polynucleotide section; and 

(3) the genomic sequence section 
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of the Reference Table. The polypeptide section will indicate where the stop codon is 
located in the MLS sequence. The MLS sequence can be matched to the corresponding genomic 
sequence. Based on the match between MLS and corresponding genomic sequences, the 
location of the stop codon can be determined in one of the regions of the genomic sequence. The 
5 location of this stop codon is the end of the terminal exon. Generally, the first base of the exon of 
the corresponding genomic region that matches the cDNA sequence, in which the stop codon 
was located, will represent the beginning of the terminal exon. In some cases, the translational 
start site will represent the start of the terminal exon, which will be the only exon. 

In the case when the MLS sequences are in the positive strand of the 
H) corresponding genomic sequence, the last base will be a larger number than the first base. When 
p the MLS sequences are in the negative strand of the corresponding genomic sequence, then the 
last base will be a smaller number than the first base. 



B. INTRON SEQUENCES 
In addition, the introns corresponding to the MLS are defined by identifying the genomic 
Q sequence located between the regions where the genomic sequence comprises exons. Thus, 
ipy introns are defined as starting one base downstream of a genomic region comprising an exon, 
"f: and end one base upstream from a genomic region comprising an exon. 

20 C. PROMOTER SEQUENCES 

As indicated below, promoter sequences corresponding to the MLS are defined as 
sequences upstream of the first exon; more usually, as sequences upstream of the first of multiple 
transcription start sites; even more usually as sequences about 2,000 nucleotides upstream of the 
first of multiple transcription start sites. 

25 

III. LINK of cDNA SEQUENCES to CLONE IDs 

As noted above, the Reference Table identifies the cDNA clone(s) that relate to each 
MLS. The MLS sequence can be longer than the sequences included in the cDNA clones. In 
such a case, the Reference Table indicates the region of the MLS that is included in the clone. If 
30 either the 5' or 3' termini of the cDNA clone sequence is the same as the MLS sequence, no 
mention will be made. 
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IV. Multiple Transcription Start Sites 

Initiation of transcription can occur at a number of sites of the gene. The 
Reference Table indicates the possible multiple transcription sites for each gene. In the 
Reference Table, the location of the transcription start sites can be either a positive or negative 
number. 

The positions indicated by positive numbers refer to the transcription start sites as located 
in the MLS sequence. The negative numbers indicate the transcription start site within the 
genomic sequence that corresponds to the MLS. 

To determine the location of the transcription start sites with the negative 
numbers, the MLS sequence is aligned with the corresponding genomic sequence. In the 
instances when a public genomic sequence is referenced, the relevant corresponding genomic 
sequence can be found by direct reference to the nucleotide sequence indicated by the "gi" 
number shown in the public genomic DNA section of the Reference Table. When the position is 
a negative number, the transcription start site is located in the corresponding genomic sequence 
upstream of the base that matches the beginning of the MLS sequence in the alignment. The 
negative number is relative to the first base of the MLS sequence which matches the genomic 
sequence corresponding to the relevant "gi" number. 

In the instances when no public genomic DNA is referenced, the relevant 
nucleotide sequence for alignment is the nucleotide sequence associated with the amino acid 
sequence designated by "gi" number of the later PolyP SEQ subsection. 

V. Polypeptide Sequences 

The PolyP SEQ subsection lists SEQ ID NOs and Ceres SEQ ID NO for polypeptide 
sequences corresponding to the coding sequence of the MLS sequence and the location of the 
translational start site with the coding sequence of the MLS sequence. 

The MLS sequence can have multiple translational start sites and can be capable of 
producing more than one polypeptide sequence. 

A. Signal Peptide 
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The Reference tables also indicate in subsection (B) the cleavage site of the putative 
signal peptide of the polypeptide corresponding to the coding sequence of the MLS sequence. 
Typically, signal peptide coding sequences comprise a sequence encoding the first residue of the 
polypeptide to the cleavage site residue. 
B. Domains 

Subsection (C) provides information regarding identified domains (where present) within 
the polypeptide and (where present) a name for the polypeptide domain. 

C. Related Polypeptides 

Subsection (Dp) provides (where present) information concerning amino acid sequences 
that are found to be related and have some percentage of sequence identity to the polypeptide 
sequences of the Reference and Sequence Tables. These related sequences are identified by a 
"gi" number. 

VI. Related Polynucleotide Sequences 



Subsection (Dn) provides polynucleotide sequences (where present) that are related to 
and have some percentage of sequence identity to the MLS or corresponding genomic sequence. 



Abbreviation 


Description 


Max Len. Seq. 


Maximum Length Sequence 


rel to 


Related to 


Clone Ids 


Clone ID numbers 


Pub gDNA 


Public Genomic DNA 


gi No. 


gi number 


Gen. Seq. in Cdna 


Genomic Sequence in cDNA 

(Each region for a single gene prediction is 

listed on a separate line. 

In the case of multiple gene predictions, the 

group of regions relating to a single prediction 

are separated by a blank line) 


(Ac) cDNA SEQ 


cDNA sequence 
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Abbreviation 


Description 


- Pat. Appln. SEQ ID NO 


Patent Application SEQ ID NO: 


- Ceres SEQ ID NO: 1673877 


Ceres SEQ ID NO: 


- SEQ # w. TSS 


Location within the cDNA sequence, SEQ ID 
NO:, of Transcription Start Sites which are 
listed below 


- Clone ID #:#-># 


Clone ID comprises bases # to # of the cDNA 
Sequence 


PolyP SEQ 


Polypeptide Sequence 


- Pat. Appln. SEQ ID NO: 


Patent Application SEQ ID NO: 


- Ceres SEQ ID NO 


Ceres SEQ ID NO: 


- Loc. SEQ ID NO: @ nt. 


Location of translational start site in cDNA of 
SEQ ID NO: at nucleotide number 


(C) Pred. PP Nom. & Annot. 


Nomination and Annotation of Domains within 
Predicted Polypeptide(s) 


- (Title) 


Name of Domain 


- Loc. SEQ ID NO #:#-># aa. 


Location of the domain within the polypeptide 
of SEQ ID NO: from # to # amino acid 
residues. 


(Dp) Rel. AA SEQ 


Related Amino Acid Sequences 


- Align. NO 


Alignment number 


- gi No 


Gi number 


- Desp. 


Description 


- % Idnt. 


Percent identity 


- Align. Len. 


Alignment Length 


- Loc. SEQ ID NO: # -> # aa 


Location within SEQ ID NO: from # to # 
amino acid residue. 



2. Protein Group Table 
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This table indicates groups of proteins that share a signature sequence (also 
referred to as a consensus sequence). The Protein group also referred to as the Ortholog 
group is named by the peptide ID with which all members were compared. Each group 
contains sequences that were included at the 10' 50 5 10" 30 , and 10" 10 p-value cutoffs. For 
each group, the peptide ID and at which cutoff the peptide was included into the group. 
The same peptide ID may be included in the group three times as peptide ID 50, peptide 
ID 30 and peptide ID 10. The data indicates that peptide ID was included in the group 
when the threshold was either 10~ 50 , 10~ 30 , or 10' [ . All the peptide IDs that are followed 
by "50" were included in the protein group when the e-value cutoff was 10' 50 . All the 
peptide IDs that are followed by either "30" or "50" were included in the protein group 
when the threshold e-value was 10" 30 . All the peptide IDs that are followed by "10", 
"30" or "50" were included in the protein group when 10" 10 was used as the e-value 
cutoff. 

At the end of each protein group is a list of the consensus sequence that proteins share at the 10" 50 , 
10~ 30 , or 10" 10 . The consensus sequence contains both lower-case and upper-case letters. The upper- 
case letters represent the standard one-letter amino acid abbreviations. The lower case letters 
represent classes of amino acids: 

• "t" refers to tiny amino acids, which are specifically alanine, glycine, serine and threonine. 

• "p M refers to polar amino acids, which are specifically, asparagine and glutamine 

• "n" refers to negatively charged amino acids, which are specifically, aspartic acid and 
glutamic acid 

• "+" refers to positively charged residues, which are specifically, lysine, arginine, and 
histidine 

• "r" refers to aromatic residues, which are specifically, phenylalanine, tyrosine, and 
tryptophan, 

• "a" refers to aliphatic residues, which are specifically, isoleucine, valine, leucine, and 
methonine 



3. Protein Group Matrix Table 

In addition to each consensus sequence, Applicants have generated a scoring matrix to 
provide further description of the consensus sequence. The first row of each matrix indicates the 
residue position in the consensus sequence. The matrix reports number of occurrences of all the 
amino acids that were found in the group members for every residue position of the signature 
sequence. The matrix also indicates for each residue position, how many different organisms 
were found to have a polypeptide in the group that included a residue at the relevant position. 
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The last line of the matrix indicates all the amino acids that were found at each position of the 
consensus. 

4. MA diff Table 

The MA_diff Table presents the results of the differential expression experiments for the 
mRNAs, as reported by their corresponding cDNA ID number, that were differentially 
transcribed under a particular set of conditions as compared to a control sample. The cDNA ID 
numbers correspond to those utilized in the Reference and Sequence Tables. Increases in mRNA 
abundance levels in experimental plants versus the controls are denoted with the plus sign (+). 
Likewise, reductions in mRNA abundance levels in the experimental plants are denoted with the 
minus (-) sign. 

The Table is organized according to each set of experimental conditions, which are 
denoted by the term "Expt ID:" followed by a particular number. The table below links each 
Expt ID with a short description of the experiment and the parameters. 

For each experiment ID a method of the normalization is specified. "Method: 2" 
represents normalization by median the goal of the method is to adjust the ratios by a factor so 
that the median of the ratio distribution is 1 . Method 3 is the normalization procedure conducted 
by Aglilent Technologies, Inc. Palo Alto, California, USA. 

The MA diff Table also specifies the specific parameters and the experiment number 
(e.g. 107871) used in compiling the data. The experiment numbers are referenced in the 
appropriate utility/functions sections herein. The background threshold was set to 
"BKG_Threshold=X" to reduce the effect of the background on the signal. 

Finally, the Table includes reference to an "Organism_ID" number. This number refers to 
the cDNA spotted on the chip were similar to Arabidopsis thaliana (3769) sequences or whether 
the oligo used for the chips were similar to Zea mays (3 1 1987)sequences. 



5. MA diff (Experiment) Tables 



Example No. 


Experiment 


genome 


Expt ID 


Value 


Parameter 


Units 




short name 
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3ii 


3642-1 


Arabidopsis 


108512 


3746-1 


Plant Line 


Hours 


3n 


Arab 0.001% 
MeJAl 


Arabidopsis 


108568 


Aerial 


Tissue 


Tissue 










0.001% Me J 
A 


Treatment 


Compound 










1 


Timepoint 


Hours 


3n 


Arab 0.001% 
MeJA_l 


Arabidopsis 


108569 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










0.001% MeJ 
A 


Treatment 

X 1 vtiviiivili 


romnnutid 


3j 


Arab_0.1uM_E 
pi-Brass_l 


Arabidopsis 


108580 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










0.1 uM Brassi 
nonsteroid 


Treatment 

X 1 vtillllVlll 


romnoiind 

Vlll YJ \J U1IU. 


3j 


Arab_0.1nM E 
pi-Brass 1 


Arabidopsis 


108581 

X \J \j \J X 


Aerial 


HTi s<iiip 


Tl OC1 IP* 










6 


Timepoint 


Hours 










0.1uM_Brassi 
no Steroid 


Treatment 


Compound 


3g 


Arab lOOuM 
ABAl 


Arabidopsis 


108560 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










lOOuM ABA 


Treatment 


Comnound 


3g 


Arab lOOuM 
ABAl 


Arabidopsis 


108561 


Aerial 


Tissue 


Tissue 










lOOuM ABA 


Treatment 


Compound 










6 


Timepoint 


Hours 


31 


Arab lOOuM 
BA_1 


Arabidopsis 


108566 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










100uM_BA 


Treatment 


Compound 


31 


Arab lOOuM 
BA_1 


Arabidopsis 


108567 


Aerial 


Tissue 


Tissue 










100uM_BA 


Treatment 


Compound 










6 


Timepoint 


Hours 


3k 


Arab lOOuM 
GA3_1 


Arabidopsis 


108562 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










lOOuM GA3 


Treatment 


Compound 


3k 


Arab lOOuM 
GA3_1 


Arabidopsis 


108563 


Aerial 


Tissue 


Tissue 










lOOuM GA3 


Treatment 


Compound 










6 


Timepoint 


Hours 



669 



PATENT 

Attorney Docket No. 2750-1481P 



3h 


Arab lOOuM 
NAA~ 1 


A rabi d onsi s 


108564 


A priq] 
/Vt/l idl 


1 1 CCl If* 


Tl CCUP 










1 


Timepoint 


Hours 










100uM NAA 

1 \J\J U1V1 1 >( 


Trfatm pnt 

1 ledlllieill 


Pnmnni inH 


3h 


Arab lOOuM 
NAA~ 1 


Arabidonsi s 


108565 


Aprial 

xlvl 1C11 


Tl QRllP 
X loo Lit/ 


Ti cciip 










100uM_NAA 


Treatment 


Compound 










6 


Tl m pnni nt 
x inxeLFVJiiii 


nuui o 


3r 


Arab 20% PE 
G_l ~ 


Arabidonsi^ 


108570 


A prial 


Tl CCl lf=» 


Tl CCl 1 P> 










1 


Timepoint 


Hours 












Tt* £1 TTTl f=* t 
1 ItdHllClll 


( /^ttT T^r^i inn 
V_^VJlll|JiJ U11U 


3r 


Arab 20% PE 

G 1 ~ 


A ra b i H nrw i c 


1 08571 


Idl 


Tl CC11£* 

1 lobUe 


Tl CC11P 

1 issue 










20%PEG 


Treatment 


Compound 












i linepoini 


Hours 


3o 


Arab 2mM SA 

.TlX ClU ^«1111VX 
1 


ArabiHrmcic 

ill CiU'lvXVJL/OlO 


108586 


A ^t*i o 1 

/VCl Idl 


T"l CCl If* 


1 issue 










2mM_SA 


Treatment 


Compound 










i 
i 


i lmepoini 


Hours 


3o 


Arab 2mM SA 

n l ci Ly ^.xiiivx OA 
1 " 


A i*a i"iifir\"nci c 

dUlU.U|Jolo 


108587 


A £*TM 0 1 

/■\cridi 


i issue 


i issue 










6 


Timepoint 


Hours 












1 ICdLIIlCIll 


v^uiiipo unu 


3u 


Arab 5mM H2 

02 r 


A ra bi d nn <;i c 


108582 


A f»T*1 0 1 
Idl 


Tl CCl 1£* 


I 1CC11P 

I issue 










1 


Timepoint 


Hours 












i redimeni 


compound 


3u 


Arab 5mM H2 
02 f 


A rabi H n n cj c 


10858^ 


A f=*T*1 tl 1 


Tl CC11P 

1 issue 


i issue 










5mM_H202 


Treatment 


Compound 










w 


I i mpr\ni nt 

i micpuiiii 


Hours 


3v 


Arab 5mM Na 

NP_r 


Arabidon^i 0 , 


108584 


A p t* i a 1 
/\t i idi 


Ti cci i^ 
1 lobUC 


Tl CCl If* 

i looue 










1 


Timenoint 


Honrs 

x iu vii o 










5mM NaNP 


Treatment 


Comnound 

V/W111UU 14X1^1 


3v 


Arab 5mM Na 
NP_f 


Arabidopsis 


108585 


Aerial 


Ticcne 


Ti^c^iip 

X IojUv 










5mM NaNP 


Treatment 


Compound 










6 


Timepoint 


Hours 


3t 


Arab_Cold_l 


Arabidopsis 


108578 


Aerial 


Tissue 


Tissue 










Cold 


Treatment 


Compound 










1 


Timepoint 


Hours 


3t 


Arab_Cold_l 


Arabidopsis 


108579 


Aerial 


Tissue 


Tissue 
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6 


Timepoint 


Hours 










Cold 


Treatment 

X 1 viiillivllt 


Comnound 

\-/V/UlUV/ U11U 




Arab Drought 
1 


Arabidopsis 


108572 


Aerial 


Tissue 

A UJUV 


Ti ssue 










1 


Timepoint 


Hours 










Drought 


Treatment 


Compound 


3g 


Arab Drought 
1 


Arabidopsis 


108573 


Aerial 


Tissue 


Tissue 










Drought 


Treatment 


Compound 










6 


Timepoint 


Hours 


3s 


Arab Heat 1 


Arabidonsis 


108576 


Aerial 


Tissue 


Tissue 










1 


Timenrvint 


Hours 










Heat (42 deg 
O 


Treatment 


Compound 


3s 


Arab Heat 1 


ArahiHrvnsis 

.ill CLLSIU-VSL/OIO 


108577 


A pri a 1 


Ti sci ip 


Ti CC11P 










Heat (42 deg 
w 


Treatment 


Compound 














T-fonro. 

1 1VJ Lil o. 


3aa (ovule) 


ArabLer- 
ni ovule 1 


Arabidopsis 


108595 


Ler_pi 


Plant Line 


Hours 










0\/ii1p 

v_7 V L11C- 


Ti CGI If* 


Ti CC11F* 


3b 


Arab T ,er- 
rhl_rootl 


A raHi H rvnsi s 


1 08594 


T pi* rlil 

JUvt/l ill! 


Plant T itip 


T-Tmirc 

ll\J HI o 










Root 


Ti QC11P 
1 1 00 Lit/ 




31 


Arab N03 H- 
to-l/l 


Arabidonsis 


108592 


Aerial 


Ti ssnp 


Ti s^hp 

A lOO LAV-/ 










Low Nitrogen 


Treatment 


Compound 










12 


Ti m pr\ni nt 


1 IUUIIj 


31 


Arab N03 H- 

x 11 W -1 1 V^X ^/ A A 

to-LJ 


Arabidonsis 


108593 


A pri a 1 

nvi icii 


Tissiip 


T'l QCUP 

A IdOUW 










24 


Timepoint 


Hours 










T ,ow Nitrogen 


Trpatrn pnt 


(^omriniind 

\J 1 1 1 VJ \J LAI 1 \Jk 


31 


Arab N03 L- 
to-Hl 


Arabidonsi s 


108588 


Aerial 


Tissue 


Ti SSI IP 










2 


Timepoint 


Hours 










Nitrogen 


Treatment 

A A vUllllVlll 


Comnound 

lAAAVl 


31 


Arab N03 L- 
to-H_l 


Arabidopsis 


108589 


Aerial 


Tissue 


Tissue 










Nitrogen 


Treatment 


Compound 










6 


Timepoint 


Hours 


31 


Arab N03 L- 
to-H_l 


Arabidopsis 


108590 


Aerial 


Tissue 


Tissue 










9 


Timepoint 


Hours 










Nitrogen 


Treatment 


Compound 
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31 


Arab N03 L- 
to-H_l 


Arabidopsis 


108591 


Aerial 


Tissue 


Ti ssi ip 

X IOjUv 










Nitrogen 


Treatment 


Compound 










12 


Timenoint 


Hours 

X 1V_/ UX O 


3p 


Arab Woundin 
g_l 


Arabidopsis 


108574 


Aerial 


Tissup 


Ti qsup 

X XojUv 










i 


Timepoint 


Hours 










Woundin*? 


Treatment 

X 1 ^✓GLLAllvllL 


PntnnniinH 

V_^V_J1 1 1L7\_J LiliLi. 


3p 


Arab Woundin 
g 1 


Arabidopsis 


108575 


Aerial 


TlSSUP 


Ti QQ11P 










Wounding 


Treatment 


Compound 










6 


Timpnnint 

X llllv VjKJVYW 


1 1U Lil o 


3o 


Columbia/CS37 
26 flower SA 


Arabidopsis 


108475 


Columbia 

V/ \J 1 Ul 1 1 \J 1 tf- 


snppips 


1 1U LU o 










SA 


Treatment 


Compound 










5 wppIcs 

VV vvlv O 


Timpnnint 

1 llllCjJwlill 


T-Trnifc 
1 1VJ Ul o 


3o 


Columbia/CS37 
26 flower SA 


Arabidoosis 


108476 


CS3726 


oiJtviGo 


T-Tr\ni*o 

xTLJUx o 










5 wppk<s 


Ti m pnni nt 

X llllv LFVJllll 


T-Tr*lirC 

Lx\J HI o 










SA 


Treatment 


Compound 


3d 


Corn 0 001 Per 
cent MeJA 


£~j\sCL ivxa. y o 


1 UOJJJ 


/\criai 


i issue 


i issue 










24 


Timepoint 


Hours 










0 001% MpT 

v/.WVJl /O 1V1CJ 

A 


i rcaUTicni 


k^, o mpo uno 


3i 


Corn 0.1 uM B 
rassino Steroid 


7,ea IVfavs 

M- — J V-/ lYlUj O 


108557 


24 


x liiiepoini 


Hours 










Aerial 


Ti ip 

X lOO Lit 


1 l^oUC 










O.luMBrassi 
no Steroid 


Treatment 


Compound 


3g 


Corn lOOuM 
ABA 


Zea Mays 


108513 


Aerial 


Tiqciip 

X lOOUt 


Tl CC1 lf^ 
1 loo LlC 










ABA 


Treatment 


Compound 










6 


Timpnnint 

X XlXlt'LUJllll 


T-Tr\i ire 


3g 


Corn lOOuM 
ABA 


Zea Mays 


108597 


Aerial 


Tissue 


Tiq<iiip 

X loo Llv 










24 


Timepoint 


Hours 










lOOuM ABA 


Treatment 

x ivliUllvi.ll 


Pnmnni ltlH 

11 LJw 


3i 


Corn lOOuM 
BA 


Zea Mays 


108517 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










BA 


Treatment 


Compound 


3k 


Corn lOOuM 
GA3 


Zea Mays 


108519 


Aerial 


Tissue 


Tissue 










lOOuM 


Treatment 


Compound 
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Giberillic 
Acid 














1 


I 1111CL/U111L 


T-TfM ire 


3k 


Corn lOOuM 
GA3 


Zea Mays 


108520 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










lOOuM 

vJlUCrilllL 

A piH 


Treatment 


Compound 


3k 


Corn_100uM 
GA3 


Zea Mays 


108521 


Aerial 


Tissue 


Tissue 










lOOuM 

OiHerillif* 

Acid 


1 ICcUIIlClll 


i /"vfyi t>ai inn 

v_/Ompounu 










12 


Timp'noirit 

1 lllieLrVJllll 


1-1 rmt*c 
inutll a 


3h 


Corn lOOuM 
NAA~ 


Zea K'favs 


108516 


A pt*i a 1 


T"i cci if* 

1 lob LiC 


Ti COUP 










NAA 


Treatment 


Compound 












1 HllCjJOllH 


Hours 


3h 


CornJOOuM 
NAA 


Zea Mays 


108554 


Aerial 


Tissue 


Tissue 










24 


Timpr\nmt 
1 1111CUU111L 


nuuia 










NAA 


Trptitinpnt 
1 iCdUIlClll 


i^umpouiiu 


3hh 


Corn_1400- 
6/S-17 


Zea Mays 


108598 


Shoot apices 


Tissue 


Tissue 


3r 


Corn 150mM 
NaCl 


Zea Mavs 


108541 


Aerial 




It CCI If* 

1 loo LIC 










1 


Timepoint 


Hours 










ISOmM NaCl 


1 IC&llllClll 


ump U Ul 1U 


3r 


Corn 150mM 
NaCl 


Zea Mays 


108542 


Aerial 


Tiooiip 


1 loo UG 










150mM_NaCl 


Treatment 


Compound 










6 


Timenoint 


X ±\J Ul O 


3r 


Corn 150mM 
NaCl 


Zea Mays 


108553 


Aerial 


Ti^ne 

1 lOO LAW 


Ti ^^iip 










24 


Timepoint 


Hours 










150mM NaCl 


Treatment 

X l WClLlAXkvAAL 


\^UX11LF\JL*11W 


3r 


Corn 20% PE 
G 


Zea Mays 


108539 


Aerial 


Tissue 


Ti <?<;iie 
x looixe 










i 


Timepoint 


Hours 










20%PEG 


Treatment 


Compound 


3r 


Corn 20% PE 
G 


Zea Mays 


108540 


Aerial 


Tissue 


Tissue 










20%PEG 


Treatment 


Compound 










6 


Timepoint 


Hours 



673 



PATENT 

Attorney Docket No. 2750-1481P 



3o 


Corn_2mM_SA 


Zea Mays 


108515 


Aerial 


Tissue 


Tissue 










SA 


Treatment 


Compound 










12 


Timepoint 


Hours 


3o 


Corn_2mM_SA 


Zea Mays 


108552 


Aerial 


Tissue 


Tissue 










SA 


Treatment 


Compound 










24 


Timepoint 


Hours 


3u 


Corn 5mM H2 
02 


Zea Mays 


108537 


Aerial 


Tissue 


Tissue 










H202 


Treatment 


Compound 










1 


Timenoint 

X llllvUUlill 


Hours 


3u 


Corn 5mM H2 
02 


Zea Mays 


108538 

1 KJ KJ 


Aerial 


Ti ssue 


Ti ssue 










6 


Timepoint 


Hours 










H202 


Treatment 


Comnoiind 


3u 


Corn 5mM H2 

V/l XX w' 1111 ▼ X. 1 1. 

02 


Zea Mavs 


108558 
i \j %j \j 


Aerial 


Ti^np 

J. IDjUL 


Ti<l«sllP 
X iOjUL' 










24 


Timepoint 


Hours 










H202 

X X^> V/Z. 


Trp c* t TYi p n t 




3v 


Corn 5mM N 

o 


Zea Mavs 


108526 


Aerial 


Ti^llP 

X lOOUL' 


Ti<i<5iip 










NO 


Treatment 


Compound 










1 


Ti m p"noi nt 

X 1111 W LFL/llll 


X i\J 141 0 


3v 


Corn 5mM N 

o 


Zea IVTavs 


108527 


Atrial 


Ti ^qiip 


Ti ^^iip 










6 


Timenoint 


Hours 










NO 


Treatment 

1 1 VUtillVllL 


Comnound 


3v 


Corn 5mM N 
O 


Zea Mays 


108559 


Aerial 


Tissue 


Tissue 










12 


Timepoint 


Hours 










NO 


Treatment 


Compound 


3t 


CornCold 


Zea Mays 


108533 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










Cold 


Treatment 


Compound 


3t 


Corn_Cold 


Zea Mays 


108534 


Aerial 


Tissue 


Tissue 










Cold 


Treatment 


Compound 










6 


Timepoint 


Hours 


3q 


Corn_Drought 


Zea Mays 


108502 


Drought 


Treatment 


Compound 










1 


Timepoint 


Hours 


3q 


Corn_Drought 


Zea Mays 


108503 


Drought 


Treatment 


Compound 










6 


Timepoint 


Hours 


3q 


Corn_Drought 


Zea Mays 


108504 


Drought 


Treatment 


Compound 










12 


Timepoint 


Hours 


3q 


CornJDrought 


Zea Mays 


108556 


Drought 


Treatment 


Compound 










24 


Timepoint 


Hours 
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3s 


CornHeat 


Zea Mays 


108522 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










Heat (42 deg 
C) 


Treatment 


Compound 


3s 


Corn Heat 


Zea Mays 


108523 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










Heat (42 deg 
C) 


Treatment 


Compound 


3ee 

oo 


Corn Imbibed 
Seeds 


Zea Mays 


108518 


Imbibed 


Treatment 


Compound 










4 


Age 


days old 










Roots 


Tissue 


Tissue 


3ee 

00 


Corn Imbibed 
Seeds 


Zea Mays 


108528 


Imbibed 


Treatment 


Compound 










Aerial 


Tissue 


Tissue 










5 


A&e 


davs old 


322 

^©o 


Corn Imbibed 
Seeds 


Zea Mays 


108529 


Imbibed 


Treatment 


Compound 










5 


Age 


days old 










Root 


Tissue 


Tissue 


322 


Corn Imbibed 
Seeds 


Zea Mays 


108530 

A V7 \-F \J 


Imbibed 


Treatment 

X 1 ViitlllVliL 


Comnound 










Aerial 


Tissue 


Tissue 










6 


Acre 


davs old 


322 


Corn Imbibed 

Will XJllllyl L/ 

Seeds 


Zea Mavs 


108531 


Imbibed 


Treatment 

A 1 viivillvlll 


Comnound 










6 


Age 


days old 










root 


Tissue 


Tissue 


322 


Corn Imbibed 
Seeds 


Zea Mays 


108545 


Imbibed 


Treatment 


Compound 










Aerial 


Tissue 


Tissue 










3 


Ase 


days old 


3 22 
oo 


Corn Imbibed 
Seeds 


Zea Mays 


108546 


Imbibed 


Treatment 


Compound 










3 


Age 


days old 










Root 


Tissue 


Tissue 


322 

-'oo 


Corn Imbibed 
Seeds 


Zea Mays 


108547 


Imbibed 


Treatment 

1 1 vUllllvilV 


Comnound 










Aerial 


Tissue 


Tissue 










4 


Age 


days old 


3gg 


Corn_Imbibed_ 
EmbryoEndos 
perm 


Zea Mays 


108543 


2 


Age 


days old 










Imbibed 


Treatment 


Compound 










Embryo 


Tissue 


Tissue 
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3gg 


Corn Imbibed 

Embryo_Endos 

perm 


Zea Mays 


108544 


2 


Aee 


days old 










Endosperm 


Tissue 


Tissue 










Imbibed 


Treatment 


Compound 


3ee 


Corn_Meristem 


Zea Mays 


108535 


Root 
Meristem 


Tissue 


Tissue 










192 


Timepoint 


Hours 


3ee 


ComJVleristem 


Zea Mays 


108536 


Shoot 
Meristem 


Tissue 


Tissue 










192 


Timepoint 


Hours 


3n 


Corn Nitrogen 
_H_to_L 


Zea Mays 


108532 


Roots 


Tissue 


Tissue 










Low Nitrogen 


Treatment 


Compound 










16 


Timepoint 


Hours 


3n 


Corn Nitrogen 

„H_to_L 


Zea Mays 


108548 


Root 


Tissue 


Tissue 










Low Nitrogen 


Treatment 


Compound 










4 


Timenoint 

■A- 1111V L/VliiV 


Hours 


3m 


Corn Nitrogen 
_L_to_H 


Zea Mays 


108549 


Aerial 


Tissue 


Tissue 










0.166 


Timepoint 


Hours 










Nitrogen 


Treatment 


Comnound 


3m 


Corn Nitrogen 
_L_to_H 


Zea Mays 


108550 


Aerial 


Tissue 


Ti ssue 










Nitrogen 


Treatment 


Compound 










1.5 


Timepoint 


Hours 


3m 


Corn Nitrogen 
_L_to_H 


Zea Mays 


108551 


Aerial 


Tissue 


Tissue 










3 


Timepoint 


Hours 










Nitrogen 


Treatment 


Compound 


3ff 


Corn_RTl 


Zea Mays 


108599 


Unknown 


Plant Line 


Hours 










Root 


Tissue 


Tissue 


3p 


CornJWoundin 
g 


Zea Mays 


108524 


Aerial 


Tissue 


Tissue 










Wounding 


Treatment 


Compound 










1 


Timepoint 


Hours 


3p 


CornWoundin 

g 


Zea Mays 


108525 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










Wounding 


Treatment 


Compound 


3g 


DroughtFlowe 
rs 


Arabidopsis 


108473 


Flowers 


Tissue 


Tissue 










7d 


Timepoint 


Hours 










Drought 


Treatment 


Compound 



676 



PATENT 

Attorney Docket No. 2750-1481P 



3g 


Drought Flowe 
rs 


Arabidopsis 


108474 


Flowers 


Tissue 


Tissue 










Drought 


Treatment 


Compound 










8d(ld- 
post_re- 
watering) 


Timepoint 


Hours 


3k 


GA Treated 


Arabidopsis 


108484 


1 


Timepoint 


Hours 










1 


Timepoint 


Hours 


3k 


GA Treated 


Arabidopsis 


108485 


6 


Timepoint 


Hours 










6 


Timepoint 


Hours 


3k 


GA Treated 


Arabidopsis 


108486 


12 


Timepoint 


Hours 










12 


Timeooint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108461 


Day 1 


Timepoint 


Hours 


3e 


Germinating 
Seeds 


Arabidoosis 


108462 


Day 2 


Timenoint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108463 


Day 3 


Timepoint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108464 


Day 4 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108465 


Round up 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108466 


Trimec 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108467 


Finale 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108468 


Glean 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107871 


Finale 


Treatment 


Compound 










4 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107876 


Finale 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107881 


Glean 


Treatment 


Compound 










4 


Timepoint 


Hours 


3bb 


Herbicide v2 


Arabidopsis 


107886 


Trimec 


Treatment 


Compound 










4 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107891 


Trimec 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107896 


Round-up 


Treatment 


Compound 










4 


Timepoint 


Hours 


3d 


Trichome 

Inflorescences 

expt 


Arabidopsis 


108452 


Hairy 

Influorescenc 
e#l 
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6. MA Clusters Table 

Microarray data was clustered using one of two methods: "complete linkage" or "nearest 
neighbor" analysis. These clustering methods are described in more detail elsewhere herein. The 
results of the clustering analysis are presented in the MA clust table. The table is organized as 
follows: 

"METHOD" refers to a method number which clustering method used. 
"CL_METHOD_TYPE==TRUE" refers to complete linkage method. 
"NN_METHOD_TYPE=TRUE" refers to the nearest neighbor method. 
"FULL_NN_METHOD_TYPE=TRUE" refers to the nearest neighbor method, where no size 
limitation was placed on the cluster. 

"PARAMETERS" refers to the parameters utilized for the analysis. The nature of these is 
also described in more detail elsewhere herein. 

"ORGANISM" refers to the cDNA spotted on the chip were similar to Arabidopsis 
thaliana (3769) sequences or whether the oligo used for the chips were similar to Zea mays 
(3 1 1 987)sequences. 

Each cluster or group of cDNA is identified by a "Group #", following which are the 
individual cDNA Ids that are a member of that Group 

7. Knock-in Table 

The Knock-In Table presents the results of knock-in experiments wherein plants are 
grown from tissues transformed with a marker gene-containing insert and phenotypes are 
ascertained from the transformed plants. Each section of the Table relating to information on a 
new transformant begins with a heading "Knock-in phenotype in gene (cDNAJd):" followed by 
a number which represents the Ceres internal code for a proprietary cDNA sequence. The 
described transformant was prepared by procedures described herein, wherein the identified 
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Ceres proprietary cDNA_id (corresponding to the cDNA_id in the Reference and Sequence 
Tables) was interrupted by the marker gene-containing insert.The following information is 
presented for each section. 



• Parent plants used in cross - presents the id numbers of the parent plants which 
were crossed to produce the Fl generation plant for which a phenotype is 
described. The parent plant with the promoter is described by a plant line 
descriptor. 

• Clone ID - presents the clone number of the Ceres proprietary clone which was 
the source of the cDNAid. 

• Phenotype ID - represents an internal identification code. 

• Unique FI plant ID - represents the internal code for the Fl plant for which a 
phenotype is described. 

• Assay - presents the type of growth analyzed (e.g. soil gross morphology), 
followed by the assay name which corresponds to the type/location of the tissue 
that was obsereved., the name of the assay conducted for which the result 
provided the identified phenotype. 

• Phenotype - describes the phenotype noted for the Fl generation transformant. 

• Notes - may provide additional information on the described phenotype for the 
transformant. 



Each knock-in representing a transformant with an interruption in the identified cDNAid 
may be correlated with more than one identified phenotype. 

8. Knock-out Table 

The Knock-Out Table presents the results of knock-out experiments wherein plants are 
grown from tissues transformed with a marker gene-containing insert wherein phenotypes are 
ascertained from the transformed plants. Each section of the Table relating to information on a 
new transformant begins with a heading "tail id:" representing an internal code. The following 
information is presented for each section. 

br - provides another internal code for the experiment. 
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Phenotype_ id - provides an identification number for the particular phenotype identified 
for the transformant. 

assay - identifies the assay procedure utilized in the experiment to identify a phenotype 
for the transformant. 

phenotype - represents an internatl identification code. 

ratio - represents a segregation ratio. 

notes - lists any notes relevant to the identified phenotype. 

Knock-out in-genes - Identifies the genes in which the tag has inserted 

6) the less than 501 upstream of the transcriptional start site; 

7) less than 701 upstream of the translational initiation codon; 

8) between the translational initiation and termination codons of the gene, 

9) less than 301 downstream of the translational stop codon; or 

1 0) less than 1 5 1 downstream of a transcriptional termination site or a gene. 
In this table the gene is identified by its cDNA ID number, the Ceres SEQ ID that is 
indicated in the (Ac) portion of the Reference tables. For each cDNAJd, the following 
information is provided: 

the cDNA_id number. 

in parenthesis, the cluster number of which the identified cDNA is a 
member. 

the "gDNAInsert pos" representing the position of the insert in the 
corresponding gDNA sequence 

the gi nimber refers to the TIGR chromosome sequences for Arabidopsis. 

Knock-out out ofgenes: Identifies the Ceres cDNA proprietary sequences (noted by 
cDNA_id which are the same as those identified in the Reference and Sequence Tables) 
which are closest in position to the insert, both upstream and downstream from the insert. 
For each cDNA_id, the following information is provided: 

In the first parentheses, R indicates that the gene is to the right of the tag, 
L indicates that the gene is to right of the tag as the sequences is read left 
to right 

the cDNA id number 
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in next parentheses, the cluster number of which the identified cDNA is a 
member. 

the distance (in number of nucleotides) of the insert is upstream of the 
start of the gene annotation as described in the Reference Tables or 
downstream at the end the gene annotation. 

the "gDNAInsert pos" representing the position of the insert in the 
corresponding gDNA sequence 

the gi nimber refers to the TIGR chromosome sequences for Arabidopsis. 

9. Protein Domain Table 

The Protein Domain table provides details concerning the protein domains noted in the 
Reference Table. The majority of the protein domain descriptions given in the Protein Domain 
Table are obtained from Prosite, (http//www.expasy.ch/prosite/), and Pfam, 
(http//pfam.wustl.edi^rowse.shtml). Each description in The Table begins with the pfam and 
Prosite identifying numbers, the full name of the domain, and a detailed description, including 
biological and in vivo implications/functions for the domain, references which further describe 
such implications/functions, and references that describe tests/assays to measure the 
implications/functions. 

10. Single Gene Functions & Utilities Table 

The Single Gene Functions & Utilities Table describes particular utilities/functions of 
interest for individual genes. The Table identifies the cDNA JD of interest, correlates to that 
cDNA the relevant phenotype, protein domain and microarray/differential expression data. The 
final column of the Table identifies the utilities/functions of particular interest for the identified 
cDNA. 

1 1 . Cluster Functions & Utilities Table 

The Cluster Functions & Utilities Table describes particular utilities/functions of interest 
for identified clusters of genes. The Table provides the following information: 
Record # - an internal identifier. 
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Goup - identifies the group of clusters of interest, wherein each group is identified with 
the same utilities/funcions as set forth in the right-hand most column. 

CDNA - identifies the cDNA of interest with the noted utility/function. 
CDNA_Cluster - identifies the cDNA Cluster ID of interest. 
Gi No - refers to the public genomic sequence that matches to the cDNA 
NR Hit - refers to the most relevant protein domain for the cDNA of interest. 
Pfam and Pfam Desc - provide the protein domain name. 

Notes/Annotations - provides some notes relevant to the data/information analysis. 
Utilities/Functions - this rightmost column identifies utilities/functions of particular 
interest for the group of cDNAs and clusters. 

12. cDNA Clusters Table 

The cDNA Clusters Table correlates the Ceres cDNA ID nos. (in numerical order) with 
the relevant cDNA cluster which contains each cDNAID. 

13. Stanford old new cDNA map Table 

During the course of the experiments reported herein, some of the cDNA sequences were 
assigned new Ceres internal cDNA id numbers. The cDNA map Table provides a list of the 
original "old" cDNA ids and correlates those id numbers with any new cDNA id which may 
have been assigned. Thus, any "old" and "new" cDNA ids which are on the same line in the 
Table are, in fact, the same sequence. 

14. gb Only Peptides Table 

In the Protein Group table, a number of proteins encoded by Genbank predictions are 
included. These proteins were referenced with a peptide ID number. The peptide ID number is 
linked to the amino acid sequence of the Genbank prediction in this table. 

15. Stanford Old New cDNA Table 

During the course of the experiments reported herein, some of the cDNA sequences 
utilized in the Stanford Microarray differential expression analysis experiments were assigned 
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new Ceres internal cDNAid numbers. The Stanford_old_new_cDNA Table provides a list of 
the original "old" cDNA_ids and correlates those id numbers with any new cDNA_id which may 
have been assigned. Thus, any "old" and "new" cDNA ids which are on the same line in the 
Table are, in fact, the same sequence. 

16. Enhanced Amino Table 

This table list the peptide IDs of polypeptides with enhanced amino acid content. The table 
list the peptide ID following with the single letter code of the amino acid that is enhanced. The 
table also includes a frequency that the amino acid occurred. The frequency was calculated by 
dividing the total number of the desired amino acid indicated in the column by the number of 
residues in the peptide. For example, if amino acid A, occurred 50 times in a polypeptide that is 
100 amino acid long, the frequency would be 50 divided by 100 or 0.5. 

17. Stanford old new cDNA map Table ** Keep only for patent #3 
During the course of the experiments reported herein, some of the cDNA sequences were 
assigned new Ceres internal cDNA_id numbers. The docket_80090_101_cDNA_map provides a 
list of the original "old" cDNA ids in the Reference and Sequence tables and correlates those id 
numbers with any new cDNAid which may have been assigned and utilized in the remaining 
tables. Thus, any "old" and "new" cDNA ids which are on the same line in the Table are, in fact, 
the same sequence. 

17. The AFLP_int Table 

The table includes the following headings: 

A. Band id 

B. Mobility 

C. Sl-Sll 

A. Band id 

This text is the band identified in the experiment. 

C. B. Mobility 

D. The mobility of the nucleic acid bands in relation to markers in the gel. 
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C. Sl-Sll 

These are the various tissue samples from Arabidopsis thaliana (wassilewskija) 
listed below: 



5 SI Dark adapted seedlings 

52 Roots/etiolated seedlings 

53 Mature leafs, soil grown 

54 Immature buds, inflorescence meristem 

55 Flowers opened 

1 0 S6 Siliques, all stages 

57 Senescing leaves (just beginning to yellow) 

58 A pooled sample of: 

W Callus Inducing medium 

iji | Callus shoot induction 

•ft Callus root induction 

© S9 Wounding 

Methy 1-j asmonate-treated 

S 1 0 Oxidative stress 

; M 

!«& Drought stress 

20 Oxygen Stress-flooding 

SI 1 Heat treated light grown seedling 
Cold treated light grown seedlings 



The numbers below each sample correspond to the levels of expression of a 
25 particular gene, polynucleotide, or nucleic acid. The stronger the intensity of the band on 

a AFLP gel, the higher the levels of expression. 

19. The AFLP diffTabble 

This table shows which band_id corresponds to which specific cDNA Ceres SEQ ID 
30 NOs. Differential expression of the band is shown by a "+" or 
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HOW TO MAKE DIFFERENT EMBODIMENTS OF THE INVENTION 

The invention relates to (I) polynucleotides and methods of use thereof, such as 

IA. Probes, Primers and Substrates; 

IB. Methods of Detection and Isolation; 
B.l. Hybridization; 

B.2. Methods of Mapping; 

B.3. Southern Blotting; 

B.4. Isolating cDNA from Related Organisms; 

B. 5. Isolating and/or Identifying Orthologous Genes 

IC. Methods of Inhibiting Gene Expression 

C. l. Antisense 

C.2. Ribozyme Constructs; 

C . 3 . Chimeraplasts ; 

C.4 Co-Suppression; 

C.5. Transcriptional Silencing 

C.6. Other Methods to Inhibit Gene Expression 

ID. Methods of Functional Analysis; 

IE. Promoter Sequences and Their Use; 

IF. UTRs and/or Intron Sequences and Their Use; and 

IG. Coding Sequences and Their Use. 

The invention also relates to (II) polypeptides and proteins and methods of use thereof, such 

is 

IIA. Native Polypeptides and Proteins 

A.l Antibodies 

A. 2 In Vitro Applications 

IIB. Polypeptide Variants, Fragments and Fusions 

B. l Variants 
B.2 Fragments 
B.3 Fusions 
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The invention also includes (III) methods of modulating polypeptide production, such as 

IIIA. Suppression 

A.l Antisense 
A.2 Ribozymes 
5 A3 Co-suppression 

A. 4 Insertion of Sequences into the Gene to be Modulated 
A.5 Promoter Modulation 

A.6 Expression of Genes containing Dominant-Negative Mutations 

IIIB. Enhanced Expression 

10 B.l Insertion of an Exogenous Gene 

a 

|S B.2 Promoter Modulation 

w 

111 The invention further concerns (IV) gene constructs and vector construction, such as 

^ l IVA. Coding Sequences 

$5 IVB. Promoters 

O IVC. Signal Peptides 

m 
m 

Hp The invention still further relates to 

m 

y* V. Transformation Techniques 

20 

L Polynucleotides 

Exemplified SDFs of the invention represent fragments of the genome of corn, wheat, rice, 
soybean or Arabidopsis and/or represent mRNA expressed from that genome. The isolated nucleic 
acid of the invention also encompasses corresponding fragments of the genome and/or cDNA 

25 complement of other organisms as described in detail below. 

Polynucleotides of the invention can be isolated from polynucleotide libraries using 
primers comprising sequence similar to those described by or Reference, Sequence or 
polynucleotides that encode sequences Protein Group, and Protein Group Matrix tables or 
complements thereof. See, for example, the methods described in Sambrook et ah, supra. 

30 Alternatively, the polynucleotides of the invention can be produced by chemical 

synthesis. Such synthesis methods are described below. 
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It is contemplated that the nucleotide sequences presented herein may contain some small 
percentage of errors. These errors may arise in the normal course of determination of nucleotide 
sequences. Sequence errors can be corrected by obtaining seeds deposited under the accession 
numbers cited above, propagating them, isolating genomic DNA or appropriate mRNA from the 
5 resulting plants or seeds thereof, amplifying the relevant fragment of the genomic DNA or 

mRNA using primers having a sequence that flanks the erroneous sequence, and sequencing the 
amplification product. 



LA. Probes, Primers and Substrates 
10 SDFs of the invention can be applied to substrates for use in array applications such as, 

h q but not limited to, assays of global gene expression, for example under varying conditions of 
^ development, growth conditions. The arrays can also be used in diagnostic or forensic methods 
Ul (WO95/35505, US 5,445,943 and US 5,410,270). 

m 

|lj Probes and primers of the instant invention will hybridize to a polynucleotide comprising 

J 5 a sequence in or encoded by those in the Reference, Sequence, Protein Group, and Protein Group 
Matrix tables or fragments or complement thereof. Though many different nucleotide sequences 
ry can encode an amino acid sequence, the sequences of the reference and Sequence table or 

sequences that encode polypeptides or fragments thereof described in Protein Group and Protein 
y* Group Matrix tables are generally preferred for encoding polypeptides of the invention. 
20 However, the sequence of the probes and/or primers of the instant invention need not be identical 
to those in the Reference and Sequence tables or the complements thereof. For example, some 
variation in probe or primer sequence and/or length can allow additional family members to be 
detected, as well as orthologous genes and more taxonomically distant related sequences. 
Similarly, probes and/or primers of the invention can include additional nucleotides that serve as 
25 a label for detecting the formed duplex or for subsequent cloning purposes. 

Probe length will vary depending on the application. For use as primers, probes are 12- 
40 nucleotides, preferably 18-30 nucleotides long. For use in mapping, probes are preferably 50 
to 500 nucleotides, preferably 100-250 nucleotides long. For Southern hybridizations, probes as 
long as several kilobases can be used as explained below. 
30 The probes and/or primers can be produced by synthetic procedures such as the triester 

method of Matteucci et al. J. Am. Chem. Soc. 103:3185( 1981); or according to Urdea et al. Proc. 
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Natl Acad. 80:7461 (1981) or using commercially available automated oligonucleotide 
synthesizers. 



LB. Methods of Detection and Isolation 
5 The polynucleotides of the invention can be utilized in a number of methods known to 

those skilled in the art as probes and/or primers to isolate and detect polynucleotides, including, 
without limitation: Southerns, Northerns, Branched DNA hybridization assays, polymerase chain 
reaction, and microarray assays, and variations thereof. Specific methods given by way of 
examples, and discussed below include: 
10 Hybridization 

:;=5S=; 

^ Methods of Mapping 

:^ Southern Blotting 

W 

111 Isolating cDNA from Related Organisms 

01 

::=y Isolating and/or Identifying Orthologous Genes. 

l fb Also, the nucleic acid molecules of the invention can used in other methods, such as high density 

ip oligonucleotide hybridizing assays, described, for example, in U.S. Pat. Nos. 6,004,753; 

}y 5,945,306; 5,945,287; 5,945,308; 5,919,686; 5,919,661; 5,919,627; 5,874,248; 5,871,973; 

; g 5,871,971; and 5,871,930; and PCT Pub. Nos. WO 9946380; WO 9933981; WO 9933870; WO 

C 9931252; WO 9915658; WO 9906572; WO 9858052; WO 9958672; and WO 9810858. 

20 

B . 1 . Hybridization 

The isolated SDFs of the Reference and Sequence tables or SDFs encoding polypeptides 
of the Protein Group and Protein Group Matrix tables or fragments thereof of the present 
invention can be used as probes and/or primers for detection and/or isolation of related 

25 polynucleotide sequences through hybridization. Hybridization of one nucleic acid to another 
constitutes a physical property that defines the subject SDF of the invention and the identified 
related sequences. Also, such hybridization imposes structural limitations on the pair. A good 
general discussion of the factors for determining hybridization conditions is provided by Sambrook 
et al. ("Molecular Cloning, a Laboratory Manual, 2nd ed., c. 1989 by Cold Spring Harbor 

30 Laboratory Press, Cold Spring Harbor, NY; see esp., chapters 1 1 and 12). Additional 



694 



PATENT 

Attorney Docket No. 2750-1481P 
considerations and details of the physical chemistry of hybridization are provided by G.H. Keller 
and M.M. Manak "DNA Probes", 2 nd Ed. pp. 1-25, c. 1993 by Stockton Press, New York, NY. 

Depending on the stringency of the conditions under which these probes and/or primers are 
used, polynucleotides exhibiting a wide range of similarity to those in the Reference and Sequence 
or encoding polypeptides of the Protein Group and Protein Group Matrix tables or fragments thereof 
can be detected or isolated. When the practitioner wishes to examine the result of membrane 
hybridizations under a variety of stringencies, an efficient way to do so is to perform the 
hybridization under a low stringency condition, then to wash the hybridization membrane under 
increasingly stringent conditions. 

When using SDFs to identify orthologous genes in other species, the practitioner will 
preferably adjust the amount of target DNA of each species so that, as nearly as is practical, the 
same number of genome equivalents are present for each species examined. This prevents faint 
signals from species having large genomes, and thus small numbers of genome equivalents per 
mass of DNA, from erroneously being interpreted as absence of the corresponding gene in the 
genome. 

The probes and/or primers of the instant invention can also be used to detect or isolate 
nucleotides that are "identical" to the probes or primers. Two nucleic acid sequences or 
polypeptides are said to be "identical" if the sequence of nucleotides or amino acid residues, 
respectively, in the two sequences is the same when aligned for maximum correspondence as 
described below. 

Isolated polynucleotides within the scope of the invention also include allelic variants of the 
specific sequences presented in the Reference, Sequence, Protein Group, and Protein Group Matrix 
tables. The probes and/or primers of the invention can also be used to detect and/or isolate 
polynucleotides exhibiting at least 80% sequence identity with the sequences of the reference, 
Sequence or encoding polypeptides of the Protein Group and Protein Group Matrix tables or 
fragments thereof. 

With respect to nucleotide sequences, degeneracy of the genetic code provides the 
possibility to substitute at least one base of the base sequence of a gene with a different base 
without causing the amino acid sequence of the polypeptide produced from the gene to be 
changed. Hence, the DNA of the present invention may also have any base sequence that has 
been changed from a sequence in the Reference, Sequence, Protein Group, and Protein Group 
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Matrix tables by substitution in accordance with degeneracy of genetic code. References 
describing codon usage include: Carels et al, J. Mol Evol 46: 45 (1998) and Fennoy et al, 
Nucl Acids Res. 21(23) : 5294 (1993). 



5 B.2. Mapping 

The isolated SDF DNA of the invention can be used to create various types of genetic 

and physical maps of the genome of corn, Arabidopsis, soybean, rice, wheat, or other plants. 

Some SDFs may be absolutely associated with particular phenotypic traits, allowing construction 

of gross genetic maps. While not all SDFs will immediately be associated with a phenotype, all 
10 SDFs can be used as probes for identifying polymorphisms associated with phenotypes of 

interest. Briefly, one method of mapping involves total DNA isolation from individuals. It is 
^ subsequently cleaved with one or more restriction enzymes, separated according to mass, 

j iJl transferred to a solid support, hybridized with SDF DNA and the pattern of fragments compared. 

01 

|y Polymorphisms associated with a particular SDF are visualized as differences in the size of 
W fragments produced between individual DNA samples after digestion with a particular restriction 
O enzyme and hybridization with the SDF. After identification of polymorphic SDF sequences, 
ji^jj linkage studies can be conducted. By using the individuals showing polymorphisms as parents in 

crossing programs, F2 progeny recombinants or recombinant inbreds, for example, are then 
y* analyzed. The order of DNA polymorphisms along the chromosomes can be determined based 
20 on the frequency with which they are inherited together versus independently. The closer two 
polymorphisms are together in a chromosome the higher the probability that they are inherited 
together. Integration of the relative positions of all the polymorphisms and associated marker 
SDFs can produce a genetic map of the species, where the distances between markers reflect the 
recombination frequencies in that chromosome segment. 
25 The use of recombinant inbred lines for such genetic mapping is described for 

Arabidopsis by Alonso-Blanco et al. (Methods in Molecular Biology, vol.82, "Arabidopsis 
Protocols", pp. 137-146, J.M. Martinez-Zapater and J. Salinas, eds., c. 1998 by Humana Press, 
Totowa, NJ) and for corn by Burr ("Mapping Genes with Recombinant Inbreds", pp. 249-254. 
In Freeling, M. and V. Walbot (Ed.), The Maize Handbook, c. 1994 by Springer- Verlag New 
30 York, Inc.: New York, NY, USA; Berlin Germany; Burr et al. Genetics (1998) 1 18: 519; 
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Gardiner, J. et al., (1993) Genetics 134: 917). This procedure, however, is not limited to plants 
and can be used for other organisms (such as yeast) or for individual cells. 

The SDFs of the present invention can also be used for simple sequence repeat (SSR) 
mapping. Rice SSR mapping is described by Morgante et al. (The Plant Journal (1993) 3: 165), 
5 Panaud et al. (Genome (1995) 38: 1 170); Senior et al. (Crop Science (1996) 36: 1676), Taramino 
et al. (Genome (1996) 39: 277) and Ahn et al. (Molecular and General Genetics (1993) 241 : 
483-90). SSR mapping can be achieved using various methods. In one instance, polymorphisms 
are identified when sequence specific probes contained within an SDF flanking an SSR are made 
and used in polymerase chain reaction (PCR) assays with template DNA from two or more 
10 individuals of interest. Here, a change in the number of tandem repeats between the SSR- 

flanking sequences produces differently sized fragments (U.S. Patent 5,766,847). Alternatively, 
*f* polymorphisms can be identified by using the PCR fragment produced from the SSR-flanking 
iJl sequence specific primer reaction as a probe against Southern blots representing different 

m 

iy individuals (U.H. Refseth et al., (1997) Electrophoresis 18: 1519). 

VI Genetic and physical maps of crop species have many uses. For example, these maps can 

S3 be used to devise positional cloning strategies for isolating novel genes from the mapped crop 

63 

IPy species. In addition, because the genomes of closely related species are largely syntenic (that is, 
; J they display the same ordering of genes within the genome), these maps can be used to isolate 
y* novel alleles from relatives of crop species by positional cloning strategies. 
20 The various types of maps discussed above can be used with the SDFs of the invention to 

identify Quantitative Trait Loci (QTLs). Many important crop traits, such as the solids content 
of tomatoes, are quantitative traits and result from the combined interactions of several genes. 
These genes reside at different loci in the genome, oftentimes on different chromosomes, and 
generally exhibit multiple alleles at each locus. The SDFs of the invention can be used to 
25 identify QTLs and isolate specific alleles as described by de Vicente and Tanksley (Genetics 
134 :585 (1993)). In addition to isolating QTL alleles in present crop species, the SDFs of the 
invention can also be used to isolate alleles from the corresponding QTL of wild relatives. 
Transgenic plants having various combinations of QTL alleles can then be created and the effects 
of the combinations measured. Once a desired allele combination has been identified, crop 
30 improvement can be accomplished either through biotechnological means or by directed 
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conventional breeding programs (for review see Tanksley and McCouch, Science 277:1063 
(1997)). 



In another embodiment, the SDFs can be used to help create physical maps of the genome 
of corn, Arabidopsis and related species. Where SDFs have been ordered on a genetic map, as 
described above, they can be used as probes to discover which clones in large libraries of plant 
DNA fragments in YACs, BACs, etc. contain the same SDF or similar sequences, thereby 
facilitating the assignment of the large DNA fragments to chromosomal positions. Subsequently, 
the large BACs, YACs, etc. can be ordered unambiguously by more detailed studies of their 
sequence composition (e.g. Marra et al. (1997) Genomic Research 7:1072-1084) and by using 
their end or other sequences to find the identical sequences in other cloned DNA fragments. The 
overlapping of DNA sequences in this way allows large contigs of plant sequences to be built 
that, when sufficiently extended, provide a complete physical map of a chromosome. Sometimes 
the SDFs themselves will provide the means of joining cloned sequences into a contig. 

The patent publication WO95/35505 and U.S. Patents 5,445,943 and 5,410,270 describe 
scanning multiple alleles of a plurality of loci using hybridization to arrays of oligonucleotides. 
These techniques are useful for each of the types of mapping discussed above. 

Following the procedures described above and using a plurality of the SDFs of the 
present invention, any individual can be genotyped. These individual genotypes can be used for 
the identification of particular cultivars, varieties, lines, ecotypes and genetically modified plants 
or can serve as tools for subsequent genetic studies involving multiple phenotypic traits. 

B.3 Southern Blot Hybridization 

The sequences from Reference and Sequence and those encoding polypeptides of Protein 
Group and Protein Group Matrix tables or fragments thereof can be used as probes for various 
hybridization techniques. These techniques are useful for detecting target polynucleotides in a 
sample or for determining whether transgenic plants, seeds or host cells harbor a gene or 
sequence of interest and thus might be expected to exhibit a particular trait or phenotype. 

In addition, the SDFs from the invention can be used to isolate additional members of 
gene families from the same or different species and/or orthologous genes from the same or 
different species. This is accomplished by hybridizing an SDF to, for example, a Southern blot 
containing the appropriate genomic DNA or cDNA. Given the resulting hybridization data, one 
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of ordinary skill in the art could distinguish and isolate the correct DNA fragments by size, 
restriction sites, sequence and stated hybridization conditions from a gel or from a library. 

Identification and isolation of orthologous genes from closely related species and alleles 
within a species is particularly desirable because of their potential for crop improvement. Many 
5 important crop traits, such as the solid content of tomatoes, result from the combined interactions 
of the products of several genes residing at different loci in the genome. Generally, alleles at 
each of these loci can make quantitative differences to the trait. By identifying and isolating 
numerous alleles for each locus from within or different species, transgenic plants with various 
combinations of alleles can be created and the effects of the combinations measured. Once a 
10 more favorable allele combination has been identified, crop improvement can be accomplished 
iJ3 either through biotechnological means or by directed conventional breeding programs (Tanksley 
Q et al. Science 277:1063(1997)). 

Jp The results from hybridizations of the SDFs of the invention to, for example, Southern 

llj blots containing DNA from another species can also be used to generate restriction fragment 
.f 5 maps for the corresponding genomic regions. These maps provide additional information about 
H*? the relative positions of restriction sites within fragments, further distinguishing mapped DNA 
JU from the remainder of the genome. 

M Physical maps can be made by digesting genomic DNA with different combinations of 

^ restriction enzymes. 

20 Probes for Southern blotting to distinguish individual restriction fragments can range in size 

from 15 to 20 nucleotides to several thousand nucleotides. More preferably, the probe is 100 to 
1,000 nucleotides long for identifying members of a gene family when it is found that repetitive 
sequences would complicate the hybridization. For identifying an entire corresponding gene in 
another species, the probe is more preferably the length of the gene, typically 2,000 to 10,000 

25 nucleotides, but probes 50-1,000 nucleotides long might be used. Some genes, however, might 
require probes up to 1,500 nucleotides long or overlapping probes constituting the full-length 
sequence to span their lengths. 

Also, while it is preferred that the probe be homogeneous with respect to its sequence, it 
is not necessary. For example, as described below, a probe representing members of a gene 

30 family having diverse sequences can be generated using PCR to amplify genomic DNA or RNA 
templates using primers derived from SDFs that include sequences that define the gene family. 
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For identifying corresponding genes in another species, the next most preferable probe is 
a cDNA spanning the entire coding sequence, which allows all of the mRNA-coding fragment of 
the gene to be identified. Probes for Southern blotting can easily be generated from SDFs by 
making primers having the sequence at the ends of the SDF and using corn or Arabidopsis 
genomic DNA as a template. In instances where the SDF includes sequence conserved among 
species, primers including the conserved sequence can be used for PCR with genomic DNA from 
a species of interest to obtain a probe. 

Similarly, if the SDF includes a domain of interest, that fragment of the SDF can be used 
to make primers and, with appropriate template DNA, used to make a probe to identify genes 
containing the domain. Alternatively, the PCR products can be resolved, for example by gel 
electrophoresis, and cloned and/or sequenced. Using Southern hybridization, the variants of the 
domain among members of a gene family, both within and across species, can be examined. 

B.4.1 Isolating DNA from Related Organisms 

The SDFs of the invention can be used to isolate the corresponding DNA from other 
organisms. Either cDNA or genomic DNA can be isolated. For isolating genomic DNA, a 
lambda, cosmid, BAC or YAC, or other large insert genomic library from the plant of interest 
can be constructed using standard molecular biology techniques as described in detail by 
Sambrook et al. 1989 (Molecular Cloning: A Laboratory Manual, 2 nd ed. Cold Spring Harbor 
Laboratory Press, New York) and by Ausubel et al. 1992 (Current Protocols in Molecular 
Biology, Greene Publishing, New York). 

To screen a phage library, for example, recombinant lambda clones are plated out on 
appropriate bacterial medium using an appropriate E. coli host strain. The resulting plaques are 
lifted from the plates using nylon or nitrocellulose filters. The plaque lifts are processed through 
denaturation, neutralization, and washing treatments following the standard protocols outlined by 
Ausubel et al. (1992). The plaque lifts are hybridized to either radioactively labeled or non- 
radioactive^ labeled SDF DNA at room temperature for about 16 hours, usually in the presence 
of 50% formamide and 5X SSC (sodium chloride and sodium citrate) buffer and blocking 
reagents. The plaque lifts are then washed at 42°C with 1% Sodium Dodecyl Sulfate (SDS) and 
at a particular concentration of SSC. The SSC concentration used is dependent upon the 
stringency at which hybridization occurred in the initial Southern blot analysis performed. For 
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example, if a fragment hybridized under medium stringency (e.g., Tm - 20°C), then this 
condition is maintained or preferably adjusted to a less stringent condition (e.g., Tm-30°C) to 
wash the plaque lifts. Positive clones show detectable hybridization e.g., by exposure to X-ray 
films or chromogen formation. The positive clones are then subsequently isolated for 
purification using the same general protocol outlined above. Once the clone is purified, 
restriction analysis can be conducted to narrow the region corresponding to the gene of interest. 
The restriction analysis and succeeding subcloning steps can be done using procedures described 
by, for example Sambrook et al. (1989) cited above. 

The procedures outlined for the lambda library are essentially similar to those used for 
YAC library screening, except that the YAC clones are harbored in bacterial colonies. The 
YAC clones are plated out at reasonable density on nitrocellulose or nylon filters supported by 
appropriate bacterial medium in petri plates. Following the growth of the bacterial clones, the 
filters are processed through the denaturation, neutralization, and washing steps following the 
procedures of Ausubel et al. 1992. The same hybridization procedures for lambda library 
screening are followed. 

To isolate cDNA, similar procedures using appropriately modified vectors are employed. 
For instance, the library can be constructed in a lambda vector appropriate for cloning cDNA 
such as >,gtl 1 . Alternatively, the cDNA library can be made in a plasmid vector. cDNA for 
cloning can be prepared by any of the methods known in the art, but is preferably prepared as 
described above. Preferably, a cDNA library will include a high proportion of full-length clones. 

B. 5. Isolating and/or Identifying Orthologous Genes 

Probes and primers of the invention can be used to identify and/or isolate polynucleotides 
related to those in the Reference, Sequence, Protein Group, and Protein Group Matrix tables. 
Related polynucleotides are those that are native to other plant organisms and exhibit either similar 
sequence or encode polypeptides with similar biological activity. One specific example is an 
orthologous gene. Orthologous genes have the same functional activity. As such, orthologous 
genes may be distinguished from homologous genes. The percentage of identity is a function of 
evolutionary separation and, in closely related species, the percentage of identity can be 98 to 
100%. The amino acid sequence of a protein encoded by an orthologous gene can be less than 75% 
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identical, but tends to be at least75% or at least 80% identical, more preferably at least 90%, most 
preferably at least 95% identical to the amino acid sequence of the reference protein. 

To find orthologous genes, the probes are hybridized to nucleic acids from a species of 
interest under low stringency conditions, preferably one where sequences containing as much as 40- 
45% mismatches will be able to hybridize. This condition is established by T m - 40°C to Tm - 48°C 
(see below). Blots are then washed under conditions of increasing stringency. It is preferable that 
the wash stringency be such that sequences that are 85 to 100% identical will hybridize. More 
preferably, sequences 90 to 100% identical will hybridize and most preferably only sequences 
greater than 95% identical will hybridize. One of ordinary skill in the art will recognize that, due to 
degeneracy in the genetic code, amino acid sequences that are identical can be encoded by DNA 
sequences as little as 67% identical or less. Thus, it is preferable, for example, to make an 
overlapping series of shorter probes, on the order of 24 to 45 nucleotides, and individually hybridize 
them to the same arrayed library to avoid the problem of degeneracy introducing large numbers of 
mismatches. 

As evolutionary divergence increases, genome sequences also tend to diverge. Thus, one 
of skill will recognize that searches for orthologous genes between more divergent species will 
require the use of lower stringency conditions compared to searches between closely related 
species. Also, degeneracy of the genetic code is more of a problem for searches in the genome 
of a species more distant evolutionary from the species that is the source of the SDF probe 
sequences. 

Therefore the method described in Bouckaert et al., U.S. Ser. No. 60/121,700 Atty. Dkt. 
No. 2750-1 17P, Client Dkt. No. 00010.001, filed February 25, 1999, hereby incorporated in its 
entirety by reference, can be applied to the SDFs of the present invention to isolate related genes 
from plant species which do not hybridize to the corn Arabidopsis, soybean, rice, wheat, and 
other plant sequences of the reference, Sequence, Protein Group, and Protein Group Matrix 
tables. 

Identification of the relationship of nucleotide or amino acid sequences among plant 
species can be done by comparing the nucleotide or amino acid sequences of SDFs of the present 
application with nucleotide or amino acid sequences of other SDFs such as those present in 
applications listed in the table below: 
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The SDFs of the invention can also be used as probes to search for genes that are related 
to the SDF within a species. Such related genes are typically considered to be members of a 
gene family. In such a case, the sequence similarity will often be concentrated into one or a few 
fragments of the sequence. The fragments of similar sequence that define the gene family 
5 typically encode a fragment of a protein or RNA that has an enzymatic or structural function. 
The percentage of identity in the amino acid sequence of the domain that defines the gene 
family is preferably at least 70%, more preferably 80 to 95%, most preferably 85 to 99%. To 
search for members of a gene family within a species, a low stringency hybridization is usually 
performed, but this will depend upon the size, distribution and degree of sequence divergence of 
10 domains that define the gene family. SDFs encompassing regulatory regions can be used to 

identify coordinately expressed genes by using the regulatory region sequence of the SDF as a 
probe. 



IJl In the instances where the SDFs are identified as being expressed from genes that confer 

a particular phenotype, then the SDFs can also be used as probes to assay plants of different 
¥5 species for those phenotypes. 



si**** 



ijy I.C. Methods to Inhibit Gene Expression 

;P The nucleic acid molecules of the present invention can be used to inhibit gene 

y, transcription and/or translation. Example of such methods include, without limitation: 
20 Antisense Constructs; 

Ribozyme Constructs; 

Chimeraplast Constructs; 

Co- Suppression; 

Transcriptional Silencing; and 
25 Other Methods of Gene Expression. 

C.l Antisense 

In some instances it is desirable to suppress expression of an endogenous or exogenous 
gene. A well-known instance is the FLAVOR-SAVOR™ tomato, in which the gene encoding 
30 ACC synthase is inactivated by an antisense approach, thus delaying softening of the fruit after 
ripening. See for example, U.S. Patent No. 5,859,330; U.S. Patent No. 5,723,766; Oeller, et al, 
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Science, 254:437-439(1991); and Hamilton et al, Nature, 346:284-287 (1990). Also, timing of 
flowering can be controlled by suppression of the FLOWERING LOCUS C (FLC); high levels of 
this transcript are associated with late flowering, while absence of FLC is associated with early 
flowering (S.D. Michaels et al., Plant Cell IT :949 (1999). Also, the transition of apical meristem 
from production of leaves with associated shoots to flowering is regulated by TERMINAL 
FLOWER I, APETALA1 and LEAFY. Thus, when it is desired to induce a transition from shoot 
production to flowering, it is desirable to suppress TFL1 expression (S.J. Liljegren, Plant Cell 
IT: 1007 (1999)). As another instance, arrested ovule development and female sterility result 
from suppression of the ethylene forming enzyme but can be reversed by application of ethylene 
(D. De Martinis et al., Plant Cell H:1061 (1999)). The ability to manipulate female fertility of 
plants is useful in increasing fruit production and creating hybrids. 

In the case of polynucleotides used to inhibit expression of an endogenous gene, the 
introduced sequence need not be perfectly identical to a sequence of the target endogenous gene. 
The introduced polynucleotide sequence will typically be at least substantially identical to the target 
endogenous sequence. 

Some polynucleotide SDFs in the Reference, Sequence, Protein Group, and Protein 
Group Matrix tables represent sequences that are expressed in corn,wheat, rice, soybean 
Arabidopsis and/or other plants. Thus the invention includes using these sequences to generate 
antisense constructs to inhibit translation and/or degradation of transcripts of said SDFs, 
typically in a plant cell. 

To accomplish this, a polynucleotide segment from the desired gene that can hybridize to 
the mRNA expressed from the desired gene (the "antisense segment") is operably linked to a 
promoter such that the antisense strand of RNA will be transcribed when the construct is present in 
a host cell. A regulated promoter can be used in the construct to control transcription of the 
antisense segment so that transcription occurs only under desired circumstances. 

The antisense segment to be introduced generally will be substantially identical to at least a 
fragment of the endogenous gene or genes to be repressed. The sequence, however, need not be 
perfectly identical to inhibit expression. Further, the antisense product may hybridize to the 
untranslated region instead of or in addition to the coding sequence of the gene. The vectors of the 
present invention can be designed such that the inhibitory effect applies to other proteins within a 
family of genes exhibiting homology or substantial homology to the target gene. 

704 



PATENT 

Attorney Docket No. 2750-1481P 
For antisense suppression, the introduced antisense segment sequence also need not be full 
length relative to either the primary transcription product or the fully processed mRNA. Generally, a 
higher percentage of sequence identity can be used to compensate for the use of a shorter sequence. 
Furthermore, the introduced sequence need not have the same intron or exon pattern, and homology 
of non-coding segments may be equally effective. Normally, a sequence of between about 30 or 40 
nucleotides and the full length of the transcript canbe used, though a sequence of at least about 100 
nucleotides is preferred, a sequence of at least about 200 nucleotides is more preferred, and a 
sequence of at least about 500 nucleotides is especially preferred. 

C.2. Ribozymes 

It is also contemplated that gene constructs representing ribozymes and based on the 
SDFs in the Reference and Sequence tables or those encoding polypeptides of the Protein Group 
and Protein Group Matrix tables and fragment thereof are an object of the invention. Ribozymes 
can also be used to inhibit expression of genes by suppressing the translation of the mRNA into a 
polypeptide. It is possible to design ribozymes that specifically pair with virtually any target RNA 
and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the 
target RNA. In carrying out this cleavage, the ribozyme is not itself altered, and is thus capable of 
recycling and cleaving other molecules, making it a true enzyme. The inclusion of ribozyme 
sequences within antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the 
activity of the constructs. 

A number of classes of ribozymes have been identified. One class of ribozymes is 
derived from a number of small circular RNAs, which are capable of self-cleavage and 
replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper virus 
(satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the satellite RNAs 
from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco mottle virus, solanum 
nodiflorum mottle virus and subterranean clover mottle virus. The design and use of target 
RNA-specific ribozymes is described in Haseloff et al. Nature, 334 :585 (1988). 

Like the antisense constructs above, the ribozyme sequence fragment necessary for 
pairing need not be identical to the target nucleotides to be cleaved, nor identical to the 
sequences in the Reference and Sequence tables or those encoding polypeptide of the Protein 
Group and Protein Group Matrix tables or fragments thereof. Ribozymes may be constructed by 
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combining the ribozyme sequence and some fragment of the target gene which would allow 
recognition of the target gene mRNA by the resulting ribozyme molecule. Generally, the 
sequence in the ribozyme capable of binding to the target sequence exhibits a percentage of 
sequence identity with at least 80%, preferably with at least 85%, more preferably with at least 
5 90% and most preferably with at least 95%, even more preferably, with at least 96%, 97%, 98% or 
99% sequence identity to some fragment of a sequence in the Reference, Sequence, Protein 
Group, and Protein Group Matrix tables or the complement thereof. The ribozyme can be 
equally effective in inhibiting mRNA translation by cleaving either in the untranslated or coding 
regions. Generally, a higher percentage of sequence identity can be used to compensate for the 

1 0 use of a shorter sequence. Furthermore, the introduced sequence need not have the same intron 

Q 

• o or exon pattern, and homology of non-coding segments may be equally effective. 

U 

tf! C.3. Chimeraplasts 

0i 

iflj The SDFs of the invention, such as those described by Reference, Sequence, Protein 

H5 Group, and Protein Group Matrix tables, can also be used to construct chimeraplasts that can be 

O introduced into a cell to produce at least one specific nucleotide change in a sequence 

m 

corresponding to the SDF of the invention. A chimeraplast is an oligonucleotide comprising 
% DNA and/or RNA that specifically hybridizes to a target region in a manner which creates a 
h* mismatched base-pair. This mismatched base-pair signals the cell's repair enzyme machinery 
20 which acts on the mismatched region resulting in the replacement, insertion or deletion of 

designated nucleotide(s). The altered sequence is then expressed by the cell's normal cellular 
mechanisms. Chimeraplasts can be designed to repair mutant genes, modify genes, introduce 
site-specific mutations, and/or act to interrupt or alter normal gene function (US Pat. Nos. 
6,010,907 and 6,004,804; and PCT Pub. No. W099/58723 and WO99/07865). 

25 

C.4. Sense Suppression 

The SDFs of the reference, Sequence, Protein Group, and Protein Group Matrix tables of 
the present invention are also useful to modulate gene expression by sense suppression. Sense 
suppression represents another method of gene suppression by introducing at least one 
30 exogenous copy or fragment of the endogenous sequence to be suppressed. 
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Introduction of expression cassettes in which a nucleic acid is configured in the sense 
orientation with respect to the promoter into the chromosome of a plant or by a self-replicating virus 
has been shown to be an effective means by which to induce degradation of mRNAs of target genes. 
For an example of the use of this method to modulate expression of endogenous genes see, Napoli 
5 et al., The Plant Cell 2:279 (1990), and U.S. Patents Nos. 5,034,323, 5,231,020, and 5,283,184. 
Inhibition of expression may require some transcription of the introduced sequence. 

For sense suppression, the introduced sequence generally will be substantially identical to 
the endogenous sequence intended to be inactivated. The minimal percentage of sequence identity 
will typically be greater than about 65%, but a higher percentage of sequence identity might exert a 
II) more effective reduction in the level of normal gene products. Sequence identity of more than about 
; 0 80% is preferred, though about 95% to absolute identity would be most preferred. As with 
y antisense regulation, the effect would likely apply to any other proteins within a similar family of 

^ genes exhibiting homology or substantial homology to the suppressing sequence. 

y 

in 
m 

,,15 C.5. Transcriptional Silencing 

j;^ The nucleic acid sequences of the invention, including the SDFs of the reference, 

Hi Sequence, Protein Group, and Protein Group Matrix tables, and fragments thereof, contain 
g4 sequences that can be inserted into the genome of an organism resulting in transcriptional 
^ silencing. Such regulatory sequences need not be operatively linked to coding sequences to 
20 modulate transcription of a gene. Specifically, a promoter sequence without any other element 
of a gene can be introduced into a genome to transcriptionally silence an endogenous gene (see, 
for example, Vaucheret, H et al. (1998) The Plant Journal 16: 651-659). As another example, 
triple helices can be formed using oligonucleotides based on sequences from Reference, 
Sequence, Protein Group, and Protein Group Matrix tables, fragments thereof, and substantially 
25 similar sequence thereto. The oligonucleotide can be delivered to the host cell and can bind to 
the promoter in the genome to form a triple helix and prevent transcription. An oligonucleotide 
of interest is one that can bind to the promoter and block binding of a transcription factor to the 
promoter. In such a case, the oligonucleotide can be complementary to the sequences of the 
promoter that interact with transcription binding factors. 

30 



C.6. Other Methods to Inhibit Gene Expression 
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Yet another means of suppressing gene expression is to insert a polynucleotide into the gene 
of interest to disrupt transcription or translation of the gene. 

Low frequency homologous recombination can be used to target a polynucleotide insert to a 
gene by flanking the polynucleotide insert with sequences that are substantially similar to the gene 
5 to be disrupted. Sequences from Reference, Sequence, Protein Group, and Protein Group Matrix 
tables, fragments thereof, and substantially similar sequence thereto can be used for homologous 
recombination. 

In addition, random insertion of polynucleotides into a host cell genome can also be used 
to disrupt the gene of interest. Azpiroz-Leehan et al., Trends in Genetics 13:152 (1997). In this 
10 method, screening for clones from a library containing random insertions is preferred to 
. ^ identifying those that have polynucleotides inserted into the gene of interest. Such screening can 
h S3 be performed using probes and/or primers described above based on sequences from Reference, 

ipi Sequence, Protein Group, and Protein Group Matrix tables, fragments thereof, and substantially 

Cm 

j similar sequence thereto. The screening can also be performed by selecting clones or Ri plants 

18 having a desired phenotype. 

I.D. Methods of Functional Analysis 

Hp The constructs described in the methods under I.C. above can be used to determine the 

|J function of the polypeptide encoded by the gene that is targeted by the constructs. 

20 Down-regulating the transcription and translation of the targeted gene in the host cell or 

organisms, such as a plant, may produce phenotypic changes as compared to a wild-type cell or 
organism. In addition, in vitro assays can be used to determine if any biological activity, such as 
calcium flux, DNA transcription, nucleotide incorporation, etc., are being modulated by the 
down-regulation of the targeted gene. 

25 Coordinated regulation of sets of genes, e.g., those contributing to a desired polygenic 

trait, is sometimes necessary to obtain a desired phenotype. SDFs of the invention representing 
transcription activation and DNA binding domains can be assembled into hybrid transcriptional 
activators. These hybrid transcriptional activators can be used with their corresponding DNA 
elements (i.e., those bound by the DNA-binding SDFs) to effect coordinated expression of 

30 desired genes (J.J. Schwarz et al., Mol Cell Biol 12:266 (1992), A. Martinez et al., Mol Gen. 
Genet. 261:546(1999)). 
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The SDFs of the invention can also be used in the two-hybrid genetic systems to identify 
networks of protein-protein interactions (L. McAlister-Henn et al., Methods ^9:330 (1999), J.C. 
Hu et al., Methods 20:80 (2000), M. Golovkin et al., J. Biol Chem. 274:36428 (1999), K. 
Ichimura et al., Biochem. Biophys. Res. Comm. 253 :532 (1998)). The SDFs of the invention can 
also be used in various expression display methods to identify important protein-DNA 
interactions (e.g. B. Luo et al., J. Mol Biol. 266:479 (1997)). 

I.E. Promoters 

The SDFs of the invention are also useful as structural or regulatory sequences in a construct 
for modulating the expression of the corresponding gene in a plant or other organism, e.g. a 
symbiotic bacterium. For example, promoter sequences associated to SDFs of the reference, 
Sequence, Protein Group, and Protein Group Matrix tables of the present invention can be useful in 
directing expression of coding sequences either as constitutive promoters or to direct expression in 
particular cell types, tissues, or organs or in response to environmental stimuli. 

With respect to the SDFs of the present invention a promoter is likely to be a relatively small 
portion of a genomic DNA (gDNA) sequence located in the first 2000 nucleotides upstream from an 
initial exon identified in a gDNA sequence or initial "ATG" or methionine codon or translational 
start site in a corresponding cDNA sequence. Such promoters are more likely to be found in the 
first 1000 nucleotides upstream of an initial ATG or methionine codon or translational start site of a 
cDNA sequence corresponding to a gDNA sequence. In particular, the promoter is usually located 
upstream of the transcription start site. The fragments of a particular gDNA sequence that function 
as elements of a promoter in a plant cell will preferably be found to hybridize to gDNA sequences 
presented and described in the Reference table at medium or high stringency, relevant to the length 
of the probe and its base composition. 

Promoters are generally modular in nature. Promoters can consist of a basal promoter that 
functions as a site for assembly of a transcription complex comprising an RNA polymerase, for 
example RNA polymerase II. A typical transcription complex will include additional factors such 
as TFhB, TFnD, and TF M E. Of these, TFnD appears to be the only one to bind DNA directly. The 
promoter might also contain one or more enhancers and/or suppressors that function as binding sites 
for additional transcription factors that have the function of modulating the level of transcription 
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with respect to tissue specificity and of transcriptional responses to particular environmental or 
nutritional factors, and the like. 

Short DNA sequences representing binding sites for proteins can be separated from each 
other by intervening sequences of varying length. For example, within a particular functional 
5 module, protein binding sites may be constituted by regions of 5 to 60, preferably 10 to 30, more 
preferably 10 to 20 nucleotides. Within such binding sites, there are typically 2 to 6 nucleotides that 
specifically contact amino acids of the nucleic acid binding protein. The protein binding sites are 
usually separated from each other by 10 to several hundred nucleotides, typically by 15 to 150 
nucleotides, often by 20 to 50 nucleotides. DNA binding sites in promoter elements often display 
1 0 dyad symmetry in their sequence. Often elements binding several different proteins, and/or a 
■=2? plurality of sites that bind the same protein, will be combined in a region of 50 to 1 ,000 basepairs. 

%S Elements that have transcription regulatory function can be isolated from their 

W 

corresponding endogenous gene, or the desired sequence can be synthesized, and recombined in 
5j ; constructs to direct expression of a coding region of a gene in a desired tissue-specific, temporal- 
IIS specific or other desired manner of inducibility or suppression. When hybridizations are performed 
iH to identify or isolate elements of a promoter by hybridization to the long sequences presented in the 

Reference tables, conditions are adjusted to account for the above-described nature of promoters. 
* :p For example short probes, constituting the element sought, are preferably used under low 
2 temperature and/or high salt conditions. When long probes, which might include several promoter 
20 elements are used, low to medium stringency conditions are preferred when hybridizing to 
promoters across species. 

If a nucleotide sequence of an SDF, or part of the SDF, functions as a promoter or 
fragment of a promoter, then nucleotide substitutions, insertions or deletions that do not 
substantially affect the binding of relevant DNA binding proteins would be considered 
25 equivalent to the exemplified nucleotide sequence. It is envisioned that there are instances where 
it is desirable to decrease the binding of relevant DNA binding proteins to silence or down- 
regulate a promoter, or conversely to increase the binding of relevant DNA binding proteins to 
enhance or up-regulate a promoter and vice versa. In such instances, polynucleotides 
representing changes to the nucleotide sequence of the DNA-protein contact region by insertion 
30 of additional nucleotides, changes to identity of relevant nucleotides, including use of 

chemically-modified bases, or deletion of one or more nucleotides are considered encompassed 
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by the present invention. In addition, fragments of the promoter sequences described by 
Reference tables and variants thereof can be fused with other promoters or fragments to facilitate 
transcription and/or transcription in specific type of cells or under specific conditions. 

Promoter function can be assayed by methods known in the art, preferably by measuring 
5 activity of a reporter gene operatively linked to the sequence being tested for promoter function. 
Examples of reporter genes include those encoding luciferase, green fluorescent protein, GUS, 
neo, cat and bar. 

LF. UTRs and Junctions 
1 0 Polynucleotides comprising untranslated (UTR) sequences and intron/exon junctions are 

also within the scope of the invention. UTR sequences include introns and 5' or 3' untranslated 
regions ( 5' UTRs or 3' UTRs). Fragments of the sequences shown in the Reference and Sequence 
iji tables can comprise UTRs and intron/exon junctions. 
^ J These fragments of SDFs, especially UTRs, can have regulatory functions related to, for 

if example, translation rate and mRNA stability. Thus, these fragments of SDFs can be isolated for 
p use as elements of gene constructs for regulated production of polynucleotides encoding desired 
^ polypeptides. 

: P Introns of genomic DNA segments might also have regulatory functions. Sometimes 

2 regulatory elements, especially transcription enhancer or suppressor elements, are found within 
20 introns. Also, elements related to stability of heteronuclear RNA and efficiency of splicing and 
of transport to the cytoplasm for translation can be found in intron elements. Thus, these 
segments can also find use as elements of expression vectors intended for use to transform 
plants. 

Just as with promoters UTR sequences and intron/exon junctions can vary from those 
25 shown in the Reference and Sequence tables. Such changes from those sequences preferably 
will not affect the regulatory activity of the UTRs or intron/exon junction sequences on 
expression, transcription, or translation unless selected to do so. However, in some instances, 
down- or up-regulation of such activity may be desired to modulate traits or phenotypic or in 
vitro activity. 

30 



I.G. Coding Sequences 
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Isolated polynucleotides of the invention can include coding sequences that encode 
polypeptides comprising an amino acid sequence encoded by sequences described in the 
Reference and Sequence tables or an amino acid sequence presented in the Reference, Sequence, 
Protein Group, and Protein Group Matrix tables. 
5 A nucleotide sequence encodes a polypeptide if a cell (or a cell free in vitro system) 

expressing that nucleotide sequence produces a polypeptide having the recited amino acid 
sequence when the nucleotide sequence is transcribed and the primary transcript is subsequently 
processed and translated by a host cell (or a cell free in vitro system) harboring the nucleic acid. 
Thus, an isolated nucleic acid that encodes a particular amino acid sequence can be a genomic 
10 sequence comprising exons and introns or a cDNA sequence that represents the product of 
^ splicing thereof An isolated nucleic acid encoding an amino acid sequence also encompasses 

k -8 heteronuclear RNA, which contains sequences that are spliced out during expression, and 

W 

iiji mRNA, which lacks those sequences. 

Coding sequences can be constructed using chemical synthesis techniques or by isolating 
$5 coding sequences or by modifying such synthesized or isolated coding sequences as described 
m above. 

:jy In addition to coding sequences encoding the polypeptide sequences of the reference, 

Sequence, Protein Group, and Protein Group Matrix tables, which are native to corn, 

jij Arabidopsis, soybean, rice, wheat, and other plants, the isolated polynucleotides can be 

20 polynucleotides that encode variants, fragments, and fusions of those native proteins. Such 
polypeptides are described below in part II. 

In variant polynucleotides generally, the number of substitutions, deletions or insertions is 
preferably less than 20%, more preferably less than 15%; even more preferably less than 10%, 5%, 
3% or 1% of the number of nucleotides comprising a particularly exemplified sequence. It is 

25 generally expected that non-degenerate nucleotide sequence changes that result in 1 to 10, more 

preferably 1 to 5 and most preferably 1 to 3 amino acid insertions, deletions or substitutions will not 
greatly affect the function of an encoded polypeptide. The most preferred embodiments are those 
wherein 1 to 20, preferably 1 to 10, most preferably 1 to 5 nucleotides are added to, or deleted from 
and/or substituted in the sequences specifically disclosed in the Reference and Sequence tables or 

30 polynucleotides that encode polypeptides of the Protein Group, and Protein Group Matrix tables 
or fragments thereof. 
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Insertions or deletions in polynucleotides intended to be used for encoding a polypeptide 
preferably preserve the reading frame. This consideration is not so important in instances when 
the polynucleotide is intended to be used as a hybridization probe. 



5 II. Polypeptides and Proteins 

II A. Native polypeptides and proteins 

Polypeptides within the scope of the invention include both native proteins as well as 
variants, fragments, and fusions thereof. Polypeptides of the invention are those encoded by any 
of the six reading frames of sequences shown in the Reference and Sequence tables, preferably 
10 encoded by the three frames reading in the 5' to 3' direction of the sequences as shown. 
1% Native polypeptides include the proteins encoded by the sequences shown in the 

Reference and Sequence tables. Such native polypeptides include those encoded by allelic 



yi variants. 

^ j Polypeptide and protein variants will exhibit at least 75% sequence identity to those native 

polypeptides of the Reference and Sequence tables. More preferably, the polypeptide variants will 

q exhibit at least 85% sequence identity; even more preferably, at least 90% sequence identity; more 
preferably at least 95%, 96%, 97%, 98%, or 99% sequence identity. Fragments of polypeptide or 

i{ Is? 

fragments of polypeptides will exhibit similar percentages of sequence identity to the relevant 
fragments of the native polypeptide. Fusions will exhibit a similar percentage of sequence identity 
20 in that fragment of the fusion represented by the variant of the native peptide. 

Polypeptide and protein variants of the invention will exhibit at least 75% sequence identity 
to those motifs or consensus sequences of the Protein Group and Protein Group Matrix tables. 
More preferably, the polypeptide variants will exhibit at least 85% sequence identity; even more 
preferably, at least 90% sequence identity; more preferably at least 95%, 96%, 97%, 98%, or 99% 
25 sequence identity. Fragments of polypeptide or fragments of polypeptides will exhibit similar 
percentages of sequence identity to the relevant fragments of the native polypeptide that are 
indicated in the Protein Group table. Fusions will exhibit a similar percentage of sequence identity 
in that fragment of the fusion represented by the variant of the native peptide. 



30 



Furthermore, polypeptide variants will exhibit at least one of the functional properties of the 
native protein. Such properties include, without limitation, protein interaction, DNA interaction, 
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biological activity, immunological activity, receptor binding, signal transduction, transcription 
activity, growth factor activity, secondary structure, three-dimensional structure, etc. As to 
properties related to in vitro or in vivo activities, the variants preferably exhibit at least 60% of the 
activity of the native protein; more preferably at least 70%, even more preferably at least 80%, 85%, 
90% or 95% of at least one activity of the native protein. 

One type of variant of native polypeptides comprises amino acid substitutions, deletions 
and/or insertions. Conservative substitutions are preferred to maintain the function or activity of the 
polypeptide. 

Within the scope of percentage of sequence identity described above, a polypeptide of the 
invention may have additional individual amino acids or amino acid sequences inserted into the 
polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends thereof. 
Likewise, some of the amino acids or amino acid sequences may be deleted from the 
polypeptide. 

A.l Antibodies 

Isolated polypeptides can be utilized to produce antibodies. Polypeptides of the invention 
can generally be used, for example, as antigens for raising antibodies by known techniques. The 
resulting antibodies are useful as reagents for determining the distribution of the antigen protein 
within the tissues of a plant or within a cell of a plant. The antibodies are also useful for examining 
the production level of proteins in various tissues, for example in a wild-type plant or following 
genetic manipulation of a plant, by methods such as Western blotting. 

Antibodies of the present invention, both polyclonal and monoclonal, may be prepared by 
conventional methods. In general, the polypeptides of the invention are first used to immunize a 
suitable animal, such as a mouse, rat, rabbit, or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of labeled 
anti-rabbit and anti-goat antibodies as detection reagents. Immunization is generally performed by 
mixing or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 jag/injection is typically sufficient. Immunization is generally 
boosted 2-6 weeks later with one or more injections of the protein in saline, preferably using 
Freund's incomplete adjuvant. One may alternatively generate antibodies by in vitro immunization 
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using methods known in the art, which for the purposes of this invention is considered equivalent to 
in vivo immunization. 

Polyclonal antisera is obtained by bleeding the immunized animal into a glass or plastic 
container, incubating the blood at 25 °C for one hour, followed by incubating the blood at 4°C for 2- 
5 18 hours. The serum is recovered by centrifugation (e.g., l,000xg for 10 minutes). About 20-50 ml 
per bleed may be obtained from rabbits. 

Monoclonal antibodies are prepared using the method of Kohler and Milstein, Nature 256: 
495 (1975), or modification thereof. Typically, a mouse or rat is immunized as described above. 
However, rather than bleeding the animal to extract serum, the spleen (and optionally several large 
10 lymph nodes) is removed and dissociated into single cells. If desired, the spleen cells can be 
?^ screened (after removal of nonspecifically adherent cells) by applying a cell suspension to a plate, or 
H & well, coated with the protein antigen. B-cells producing membrane-bound immunoglobulin specific 

y 

for the antigen bind to the plate, and are not rinsed away with the rest of the suspension. Resulting 
B-cells, or all dissociated spleen cells, are then induced to fuse with myeloma cells to form 
hybridomas, and are cultured in a selective medium (e.g., hypoxanthine, aminopterin, thymidine 
ip medium, "HAT"). The resulting hybridomas are plated by limiting dilution, and are assayed for the 
^ production of antibodies which bind specifically to the immunizing antigen (and which do not bind 
: P to unrelated antigens). The selected Mab-secreting hybridomas are then cultured either in vitro 
m (e.g. , in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 
20 Other methods for sustaining antibody-producing B-cell clones, such as by EBV 

transformation, are known. 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive atoms 
(particularly 32 P and 125 I), electron-dense reagents, enzymes, and ligands having specific binding 
25 partners. Enzymes are typically detected by their activity. For example, horseradish peroxidase is 
usually detected by its ability to convert 3,3\5,5 5 -tetramethylbenzidine (TNB) to a blue pigment, 
quantifiable with a spectrophotometer. 

A.2 In Vitro Applications of Polypeptides 
30 Some polypeptides of the invention will have enzymatic activities that are useful in vitro. 

For example, the soybean trypsin inhibitor (Kunitz) family is one of the numerous families of 
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proteinase inhibitors. It comprises plant proteins which have inhibitory activity against serine 
proteinases from the trypsin and subtilisin families, thiol proteinases and aspartic proteinases. 
Thus, these peptides find in vitro use in protein purification protocols and perhaps in therapeutic 
settings requiring topical application of protease inhibitors. 

5 Delta-aminolevulinic acid dehydratase (EC 4.2.1.24 ) (ALAD) catalyzes the second step 

in the biosynthesis of heme, the condensation of two molecules of 5-aminolevulinate to form 
porphobilinogen and is also involved in chlorophyll biosynthesis(Kaczor et al. (1994) Plant 
Physiol. 1-4: 1411-7; Smith (1988) Biochem. J. 249: 423-8; Schneider (1976) Z. naturforsch. [C] 
31 : 55-63). Thus, ALAD proteins can be used as catalysts in synthesis of heme derivatives. 
l§ Enzymes of biosynthetic pathways generally can be used as catalysts for in vitro synthesis of the 
compounds representing products of the pathway. 



: ?*f Polypeptides encoded by SDFs of the invention can be engineered to provide 

yi 

liji purification reagents to identify and purify additional polypeptides that bind to them. This 

in 1 

jj~ allows one to identify proteins that function as multimers or elucidate signal transduction or 
|5 metabolic pathways. In the case of DNA binding proteins, the polypeptide can be used in a 
$5 similar manner to identify the DNA determinants of specific binding (S. Pierrou et al., Anal. 

Biochem. 229:99 (1995), S. Chusacultanachai et al., J. Biol. Chem. 274:23591 (1999), Q. Lin et 
Q al., J. Biol. Chem. 272:27274(1997)). 

20 II.B. POLYPEPTIDE VARIANTS , FRAGMENTS, AND FUSIONS 

Generally, variants , fragments, or fusions of the polypeptides encoded by the maximum 
length sequence(MLS) can exhibit at least one of the activities of the identified domains and/or 
related polypeptides described in Sections (C) and (D) of The Reference tables corresponding to 
the MLS of interest. 

25 

II.B .(1) Variants 

A type of variant of the native polypeptides comprises amino acid substitutions. 

Conservative substitutions, described above (see II.), are preferred to maintain the function or 

activity of the polypeptide. Such substitutions include conservation of charge, polarity, 

30 hydrophobicity, size, etc. For example, one or more amino acid residues within the sequence can be 

substituted with another amino acid of similar polarity that acts as a functional equivalent, for 
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example providing a hydrogen bond in an enzymatic catalysis. Substitutes for an amino acid within 
an exemplified sequence are preferably made among the members of the class to which the amino 
acid belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The polar neutral amino 
5 acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
positively charged (basic) amino acids include arginine, lysine and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 

Within the scope of percentage of sequence identity described above, a polypeptide of the 
invention may have additional individual amino acids or amino acid sequences inserted into the 
10 polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends thereof. Likewise, 
some of the amino acids or amino acid sequences may be deleted from the polypeptide. Amino acid 
^fl substitutions may also be made in the sequences; conservative substitutions being preferred. 

W 

|fj One preferred class of variants are those that comprise (1) the domain of an encoded 

m 

^ I polypeptide and/or (2) residues conserved between the encoded polypeptide and related 

45 polypeptides. For this class of variants, the encoded polypeptide sequence is changed by 

ip insertion, deletion, or substitution at positions flanking the domain and/or conserved residues. 

Another class of variants includes those that comprise an encoded polypeptide sequence 



= t that is changed in the domain or conserved residues by a conservative substitution. 

o 

Yet another class of variants includes those that lack one of the in vitro activities, or 
20 structural features of the encoded polypeptides. One example is polypeptides or proteins 

produced from genes comprising dominant negative mutations. Such a variant may comprise an 
encoded polypeptide sequence with non-conservative changes in a particular domain or group of 
conserved residues. 

25 II.A.(2) FRAGMENTS 

Fragments of particular interest are those that comprise a domain identified for a 
polypeptide encoded by an MLS of the instant invention and variants thereof. Also, fragments that 
comprise at least one region of residues conserved between an MLS encoded polypeptide and its 
related polypeptides are of great interest. Fragments are sometimes useful as polypeptides 

30 corresponding to genes comprising dominant negative mutations are. 
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ILA.(3) FUSIONS 
Of interest are chimeras comprising (1) a fragment of the MLS encoded polypeptide or 
variants thereof of interest and (2) a fragment of a polypeptide comprising the same domain. For 
example, an AP2 helix encoded by a MLS of the invention fused to second AP2 helix from ANT 
protein, which comprises two AP2 helices. The present invention also encompasses fusions of 
MLS encoded polypeptides, variants, or fragments thereof fused with related proteins or 
fragments thereof. 

DEFINITION OF DOMAINS 

The polypeptides of the invention may possess identifying domains as shown in The 
Reference tables. Specific domains within the MLS encoded polypeptides are indicated in The 
Reference tables. In addition, the domains within the MLS encoded polypeptide can be defined 
by the region that exhibits at least 70% sequence identity with the consensus sequences listed in 
the detailed description below of each of the domains. 

The majority of the protein domain descriptions given in the protein domain table are 
obtained from Prosite, (http//www.expasy.ch/prosite/), and Pfam, 

(http//pfam.wustl.edu/browse.shtml). Examples of domain descriptions are listed in the Protein 
Domain table. 

A. Activities of Polypeptides Comprising Signal Peptides 
Polypeptides comprising signal peptides are a family of proteins that are typically 
targeted to (1) a particular organelle or intracellular compartment, (2) interact with a particular 
molecule or (3) for secretion outside of a host cell. Example of polypeptides comprising signal 
peptides include, without limitation, secreted proteins, soluble proteins, receptors, proteins 
retained in the ER, etc. 

These proteins comprising signal peptides are useful to modulate ligand-receptor 
interactions, cell-to-cell communication, signal transduction, intracellular communication, and 
activities and/or chemical cascades that take part in an organism outside or within of any 
particular cell. 
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One class of such proteins are soluble proteins which are transported out of the cell. 
These proteins can act as ligands that bind to receptor to trigger signal transduction or to permit 
communication between cells. 

Another class is receptor proteins which also comprise a retention domain that lodges the 
receptor protein in the membrane when the cell transports the receptor to the surface of the cell. 
Like the soluble ligands, receptors can also modulate signal transduction and communication 
between cells. 

=3 I* 1 addition the signal peptide itself can serve as a ligand for some receptors. An example 

;;0 is the interaction of the ER targeting signal peptide with the signal recognition particle (SRP). 
Ill Here, the SRP binds to the signal peptide, halting translation, and the resulting SRP complex 



10 



3 



ijy then binds to docking proteins located on the surface of the ER, prompting transfer of the protein 
% into the ER. 



l^l A description of signal peptide residue composition is described below in Subsection 

*P IV.C.l. 

m 

20 HI. Methods of Modulating Polypeptide Production 

It is contemplated that polynucleotides of the invention can be incorporated into a host 
cell or in-vitro system to modulate polypeptide production. For instance, the SDFs prepared as 
described herein can be used to prepare expression cassettes useful in a number of techniques for 
suppressing or enhancing expression. 

25 An example are polynucleotides comprising sequences to be transcribed, such as coding 

sequences, of the present invention can be inserted into nucleic acid constructs to modulate 
polypeptide production. Typically, such sequences to be transcribed are heterologous to at least 
one element of the nucleic acid construct to generate a chimeric gene or construct. 

Another example of useful polynucleotides are nucleic acid molecules comprising 

30 regulatory sequences of the present invention. Chimeric genes or constructs can be generated 
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when the regulatory sequences of the invention linked to heterologous sequences in a vector 
construct. Within the scope of invention are such chimeric gene and/or constructs. 

Also within the scope of the invention are nucleic acid molecules, whereof at least a part or 
fragment of these DNA molecules are presented in the Reference and Sequence tables or 
5 polynucleotide encoding polypeptides of the Protein Group or Protein Group Matrix tables of the 
present application, and wherein the coding sequence is under the control of its own promoter 
and/or its own regulatory elements. Such molecules are useful for transforming the genome of a 
host cell or an organism regenerated from said host cell for modulating polypeptide production. 

Additionally, a vector capable of producing the oligonucleotide can be inserted into the host 
1 0 cell to deliver the oligonucleotide. 

More detailed description of components to be included in vector constructs are described 
both above and below. 



Whether the chimeric vectors or native nucleic acids are utilized, such polynucleotides 

m 



in 



m 



can be incorporated into a host cell to modulate polypeptide production. Native genes and/or 
\XB nucleic acid molecules can be effective when exogenous to the host cell. 
m Methods of modulating polypeptide expression includes, without limitation: 

Suppression methods, such as 
Antisense 
Ribozymes 
20 Co-suppression 

Insertion of Sequences into the Gene to be Modulated 
Regulatory Sequence Modulation. 

as well as Methods for Enhancing Production, such as 
25 Insertion of Exogenous Sequences; and 

Regulatory Sequence Modulation. 

III.A. Suppression 

Expression cassettes of the invention can be used to suppress expression of endogenous 
30 genes which comprise the SDF sequence. Inhibiting expression can be useful, for instance, to 
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tailor the ripening characteristics of a fruit (Oeller et al., Science 254 :437 (1991)) or to influence 
seed size_(WO98/07842) or to provoke cell ablation (Mariani et al. 5 Nature 357: 384-387 (1992). 

As described above, a number of methods can be used to inhibit gene expression in 
plants, such as antisense, ribozyme, introduction of exogenous genes into a host cell, insertion of 
5 a polynucleotide sequence into the coding sequence and/or the promoter of the endogenous gene 
of interest, and the like. 

HI. A. 1. Antisense 

An expression cassette as described above can be transformed into host cell or plant to 
10 produce an antisense strand of RNA. For plant cells, antisense RNA inhibits gene expression by 
^ preventing the accumulation of mRNA which encodes the enzyme of interest, see, e.g., Sheehy 

« et al., Proc. Nat. Acad. Sci. USA, 85:8805 (1988), and Hiatt et al., U.S. Patent No. 4,801,340. 

W 

m 

^ I IIIA.2. Ribozymes 

$5 Similarly, ribozyme constructs can be transformed into a plant to cleave mRNA and 

down-regulate translation. 



ru 

P III. A.3 . Co- Suppression 

Another method of suppression is by introducing an exogenous copy of the gene to be 
20 suppressed. Introduction of expression cassettes in which a nucleic acid is configured in the 

sense orientation with respect to the promoter has been shown to prevent the accumulation of 

mRNA. A detailed description of this method is described above. 

III.AA Insertion of Sequences into the Gene to be Modulated 

25 Yet another means of suppressing gene expression is to insert a polynucleotide into the gene 

of interest to disrupt transcription or translation of the gene. 

Homologous recombination could be used to target a polynucleotide insert to a gene using 
the Cre-Lox system (A.C. Vergunst et al., Nucleic Acids Res. 26:2729 (1998), A.C. Vergunst et al., 
Plant MoL Biol 38:393 (1998), H. Albert et al., Plant J. 7:649 (1995)). 
30 In addition, random insertion of polynucleotides into a host cell genome can also be used 

to disrupt the gene of interest. Azpiroz-Leehan et al., Trends in Genetics 13: 1 52 (1 997). In this 
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method, screening for clones from a library containing random insertions is preferred for 
identifying those that have polynucleotides inserted into the gene of interest. Such screening can 
be performed using probes and/or primers described above based on sequences from the 
Reference and Sequence tables or polynucleotides encoding polypeptides of the Protein Group or 
Protein Group Matrix tables, fragments thereof, and substantially similar sequence thereto. The 
screening can also be performed by selecting clones or any transgenic plants having a desired 
phenotype. 

III.A.5. Regulatory SequenceModulation 
The SDFs described in the Reference and Sequence tables or polynucleotides encoding 
polypeptides of the Protein Group or Protein Group Matrix tables, and fragments thereof are 
examples of nucleotides of the invention that contain regulatory sequences that can be used to 
suppress or inactivate transcription and/or translation from a gene of interest as discussed in I.C.5. 

III.A.6. Genes Comprising Dominant-Negative Mutations 
When suppression of production of the endogenous, native protein is desired it is often 
helpful to express a gene comprising a dominant negative mutation. Production of protein 
variants produced from genes comprising dominant negative mutations is a useful tool for 
research Genes comprising dominant negative mutations can produce a variant polypeptide which 
is capable of competing with the native polypeptide, but which does not produce the native result. 
Consequently, over expression of genes comprising these mutations can titrate out an undesired 
activity of the native protein. For example, The product from a gene comprising a dominant 
negative mutation of a receptor can be used to constitutively activate or suppress a signal 
transduction cascade, allowing examination of the phenotype and thus the trait(s) controlled by 
that receptor and pathway. Alternatively, the protein arising from the gene comprising a dominant- 
negative mutation can be an inactive enzyme still capable of binding to the same substrate as the 
native protein and therefore competes with such native protein. 

Products from genes comprising dominant-negative mutations can also act upon the native 
protein itself to prevent activity. For example, the native protein may be active only as a homo- 
multimer or as one subunit of a hetero-multimer. Incorporation of an inactive subunit into the 
multimer with native subunit(s) can inhibit activity. 
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Thus, gene function can be modulated in host cells of interest by insertion into these cells 
vector constructs comprising a gene comprising a dominant-negative mutation. 

III.B. Enhanced Expression 

Enhanced expression of a gene of interest in a host cell can be accomplished by either (1) 
insertion of an exogenous gene; or (2) promoter modulation. 

III.B. 1 . Insertion of an Exogenous Gene 

Insertion of an expression construct encoding an exogenous gene can boost the number of 
gene copies expressed in a host cell. 

Such expression constructs can comprise genes that either encode the native protein that is 
of interest or that encode a variant that exhibits enhanced activity as compared to the native protein. 
Such genes encoding proteins of interest can be constructed from the sequences from the Reference 
and Sequence tables or polynucleotides encoding polypeptides of the Protein Group or Protein 
Group Matrix tables, fragments thereof, and substantially similar sequence thereto. 

Such an exogenous gene can include either a constitutive promoter permitting expression 
in any cell in a host organism or a promoter that directs transcription only in particular cells or 
times during a host cell life cycle or in response to environmental stimuli. 

III.B.2. Regulatory Sequence Modulation 

The SDFs of the Reference and Sequence tables, and fragments thereof, contain regulatory 
sequences that can be used to enhance expression of a gene of interest. For example, some of these 
sequences contain useful enhancer elements. In some cases, duplication of enhancer elements or 
insertion of exogenous enhancer elements will increase expression of a desired gene from a 
particular promoter. As other examples, all 11 promoters require binding of a regulatory protein to 
be activated, while some promoters may need a protein that signals a promoter binding protein to 
expose a polymerase binding site. In either case, over-production of such proteins can be used to 
enhance expression of a gene of interest by increasing the activation time of the promoter. 

Such regulatory proteins are encoded by some of the sequences in the Reference and 
Sequence tables or polynucleotides encoding polypeptides of the Protein Group or Protein Group 
Matrix tables, fragments thereof, and substantially similar sequences thereto. 
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Coding sequences for these proteins can be constructed as described above. 

IV. Gene Constructs and Vector Construction 

To use isolated SDFs of the present invention or a combination of them or parts and/or 
5 mutants and/or fusions of said SDFs in the above techniques, recombinant DNA vectors which 
comprise said SDFs and are suitable for transformation of cells, such as plant cells, are usually 
prepared. The SDF construct can be made using standard recombinant DNA techniques 
(Sambrook et al. 1989) and can be introduced to the species of interest by Agrobacterium- 
mediated transformation or by other means of transformation (e.g., particle gun bombardment) as 
1 0 referenced below. 

!Jft The vector backbone can be any of those typical in the art such as plasmids, viruses, 

p artificial chromosomes, BACs, YACs and PACs and vectors of the sort described by 

W J 

IJl (a) BAC: Shizuya et al., Proc. Natl. Acad. Sci. USA 89: 8794-8797 (1992); 

m 

Hamilton et al., Proc. Natl. Acad. Sci. USA 93: 9975-9979 (1996); 
ft (b) YAC: Burke etaL, Science 236:806-812 (1987);. 

(c) PAC: Sternberg N. et al., Proc Natl Acad Sci USA. Jan;87(l): 103-7 (1990); 

(d) Bacteria- Yeast Shuttle Vectors: Bradshaw et al., Nucl Acids Res 23: 4850- 



03 

ru 

P 4856 (1995); 



[M* (e) Lambda Phage Vectors: Replacement Vector, e.g., 

20 Frischauf et al., J. Mol Biol 170: 827-842 (1983); or Insertion vector, e.g., 

Huynh et al., In: Glover NM (ed) DNA Cloning: A practical Approach, Vol.1 Oxford: IRL Press 

(1985); 

(f) T-DNA gene fusion vectors :Walden et al., Mol Cell Biol 1 : 175-194 (1990); and 

(g) Plasmid vectors: Sambrook et al., infra. 

25 Typically, a vector will comprise the exogenous gene, which in its turn comprises an SDF of 

the present invention to be introduced into the genome of a host cell, and which gene may be an 
antisense construct, a ribozyme construct chimeraplast, or a coding sequence with any desired 
transcriptional and/or translational regulatory sequences, such as promoters, UTRs, and 3' end 
termination sequences. Vectors of the invention can also include origins of replication, scaffold 

30 attachment regions (SARs), markers, homologous sequences, introns, etc. 
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A DNA sequence coding for the desired polypeptide, for example a cDNA sequence 
encoding a full length protein, will preferably be combined with transcriptional and translational 
initiation regulatory sequences which will direct the transcription of the sequence from the gene in 
the intended tissues of the transformed plant. 
5 For example, for over-expression, a plant promoter fragment may be employed that will 

direct transcription of the gene in all tissues of a regenerated plant. Alternatively, the plant 
promoter may direct transcription of an SDF of the invention in a specific tissue (tissue-specific 
promoters) or may be otherwise under more precise environmental control (inducible promoters). 
If proper polypeptide productionis desired, a polyadenylation region at the 3 '-end of the 
1 0 coding region is typically included. The polyadenylation region can be derived from the natural 
2 gene, from a variety of other plant genes, or from T-DNA. 

jj*; The vector comprising the sequences from genes or SDF or the invention may comprise a 

Wl marker gene that confers a selectable phenotype on plant cells. The vector can include promoter 
IIU and coding sequence, for instance. For example, the marker may encode biocide resistance, 
H (§ particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, hygromycin, or 
O herbicide resistance, such as resistance to chlorosulfuron or phosphinotricin. 

m 
iu 

: ^ IV.A. Coding Sequences 

Q 

y* Generally, the sequence in the transformation vector and to be introduced into the 

20 genome of the host cell does not need to be absolutely identical to an SDF of the present 
invention. Also, it is not necessary for it to be full length, relative to either the primary 
transcription product or fully processed mRNA. Furthermore, the introduced sequence need not 
have the same intron or exon pattern as a native gene. Also, heterologous non-coding segments 
can be incorporated into the coding sequence without changing the desired amino acid sequence 
25 of the polypeptide to be produced. 

IV.B. Promoters 

As explained above, introducing an exogenous SDF from the same species or an 
orthologous SDF from another species are useful to modulate the expression of a native gene 
30 corresponding to that SDF of interest. Such an SDF construct can be under the control of either 
a constitutive promoter or a highly regulated inducible promoter (e.g., a copper inducible 
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promoter). The promoter of interest can initially be either endogenous or heterologous to the 
species in question. When re-introduced into the genome of said species, such promoter becomes 
exogenous to said species. Over-expression of an SDF transgene can lead to co-suppression of 
the homologous endogeneous sequence thereby creating some alterations in the phenotypes of 
the transformed species as demonstrated by similar analysis of the chalcone synthase gene 
(Napoli et al., Plant Cell 2:279 (1990) and van der Krol et al., Plant Cell 2:291 (1990)). If an 
SDF is found to encode a protein with desirable characteristics, its over-production can be 
controlled so that its accumulation can be manipulated in an organ- or tissue-specific manner 
utilizing a promoter having such specificity. 

Likewise, if the promoter of an SDF (or an SDF that includes a promoter) is found to be 
tissue-specific or developmentally regulated, such a promoter can be utilized to drive or facilitate 
the transcription of a specific gene of interest {e.g., seed storage protein or root-specific protein). 
Thus, the level of accumulation of a particular protein can be manipulated or its spatial 
localization in an organ- or tissue- specific manner can be altered. 

IV. C Signal Peptides 

SDFs of the present invention containing signal peptides are indicated in the Reference, 
Sequence, the Protein Group and Protein Group Matrix tables. In some cases it may be desirable 
for the protein encoded by an introduced exogenous or orthologous SDF to be targeted (1) to a 
particular organelle intracellular compartment, (2) to interact with a particular molecule such as a 
membrane molecule or (3) for secretion outside of the cell harboring the introduced SDF. This 
will be accomplished using a signal peptide. 

Signal peptides direct protein targeting, are involved in ligand-receptor interactions and 
act in cell to cell communication. Many proteins, especially soluble proteins, contain a signal 
peptide that targets the protein to one of several different intracellular compartments. In plants, 
these compartments include, but are not limited to, the endoplasmic reticulum (ER), 
mitochondria, plastids (such as chloroplasts), the vacuole, the Golgi apparatus, protein storage 
vessicles (PSV) and, in general, membranes. Some signal peptide sequences are conserved, such 
as the Asn-Pro-Ile-Arg amino acid motif found in the N-terminal propeptide signal that targets 
proteins to the vacuole (Marty (1999) The Plant Cell 1 1 : 587-599). Other signal peptides do not 
have a consensus sequence per se, but are largely composed of hydrophobic amino acids, such as 
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those signal peptides targeting proteins to the ER (Vitale and Denecke (1999) The Plant Cell 1 1 : 
615-628). Still others do not appear to contain either a consensus sequence or an identified 
common secondary sequence, for instance the chloroplast stromal targeting signal peptides 
(Keegstra and Cline (1999) The Plant Cell 1 1 : 557-570). Furthermore, some targeting peptides 
are bipartite, directing proteins first to an organelle and then to a membrane within the organelle 
(e.g. within the thylakoid lumen of the chloroplast; see Keegstra and Cline (1999) The Plant Cell 
1 1 : 557-570). In addition to the diversity in sequence and secondary structure, placement of the 
signal peptide is also varied. Proteins destined for the vacuole, for example, have targeting 
signal peptides found at the N-terminus, at the C-terminus and at a surface location in mature, 
folded proteins. Signal peptides also serve as ligands for some receptors. 

These characteristics of signal proteins can be used to more tightly control the phenotypic 
expression of introduced SDFs. In particular, associating the appropriate signal sequence with a 
specific SDF can allow sequestering of the protein in specific organelles (plastids, as an 
example), secretion outside of the cell, targeting interaction with particular receptors, etc. 
Hence, the inclusion of signal proteins in constructs involving the SDFs of the invention 
increases the range of manipulation of SDF phenotypic expression. The nucleotide sequence of 
the signal peptide can be isolated from characterized genes using common molecular biological 
techniques or can be synthesized in vitro. 

In addition, the native signal peptide sequences, both amino acid and nucleotide, 
described in the Reference, Sequence, Protein Group or Protein Group Matrix tables can be used to 
modulate polypeptide transport. Further variants of the native signal peptides described in the 
Reference, Sequence, Protein Group or Protein Group Matrix tables are contemplated. Insertions, 
deletions, or substitutions can be made. Such variants will retain at least one of the functions of 
the native signal peptide as well as exhibiting some degree of sequence identity to the native 
sequence. 

Also, fragments of the signal peptides of the invention are useful and can be fused with 
other signal peptides of interest to modulate transport of a polypeptide. 

V. Transformation Techniques 



727 




PATENT 

Attorney Docket No. 2750-1481P 



A wide range of techniques for inserting exogenous polynucleotides are known for a 
number of host cells, including, without limitation, bacterial, yeast, mammalian, insect and plant 
cells. 

Techniques for transforming a wide variety of higher plant species are well known and 
described in the technical and scientific literature. See, e.g. Weising et al., Ann. Rev. Genet 22:421 
(1988); and Christou, Euphytica, v. 85, n. 1-3: 13-27, (1995). 

DNA constructs of the invention may be introduced into the genome of the desired plant 
host by a variety of conventional techniques. For example, the DNA construct may be introduced 
directly into the genomic DN A of the plant cell using techniques such as electroporation and 
microinjection of plant cell protoplasts, or the DNA constructs can be introduced directly to plant 
tissue using ballistic methods, such as DNA particle bombardment. Alternatively, the DNA 
constructs may be combined with suitable T-DNA flanking regions and introduced into a 
conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium 
tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell 
DNA when the cell is infected by the bacteria (McCormac et al., Mol Biotechnol 8:199 (1997); 
Hamilton, Gene 200: 1 07 ( 1 997)); Salomon et al. EMBO ^ 3:141 (1 984); Herrera-Estrella et al. 
EMBOJ. 2:987(1983). 

Microinjection techniques are known in the art and well described in the scientific and 
patent literature. The introduction of DNA constructs using polyethylene glycol precipitation is 
described in Paszkowski et al. EMBOJ. 3:2717 (1984). Electroporation techniques are described in 
Fromm et al. Proa Natl Acad. Set USA 82:5824 (1985). Ballistic transformation techniques are 
described in Klein et al. Nature 327:773 (1987). Agrobacterium tumefaciens-mediated 
transformation techniques, including disarming and use of binary or co-integrate vectors, are well 
described in the scientific literature. See, for example Hamilton, CM, Gene 200:107 (1997); Miiller 
et al. Mol Gen. Genet. 207:171 (1987); Komari et al. Plant! 10:165 (1996); Venkateswarlu et al. 
Biotechnology 9:1103 (1991) and Gleave,/LP. , Plant Mol. Biol 20:1203 (1992); Graves and 
Goldman, Plant Mol Biol 7:34 (1986) and Gould et al., Plant Physiology 95:426 (1991). 

Transformed plant cells which are derived by any of the above transformation techniques 
can be cultured to regenerate a whole plant that possesses the transformed genotype and thus the 
desired phenotype such as seedlessness. Such regeneration techniques rely on manipulation of 
certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or 



728 




PATENT 

Attorney Docket No. 2750-1481P 
herbicide marker which has been introduced together with the desired nucleotide sequences. Plant 
regeneration from cultured protoplasts is described in Evans et al., Protoplasts Isolation and Culture 
in "Handbook of Plant Cell Culture pp. 124-176, MacMillan Publishing Company, New York, 
1983; and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, 
5 1988. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such 
regeneration techniques are described generally in Klee et al. Ann. Rev. of Plant Phys. 38:467 
(1987). Regeneration of monocots (rice) is described by Hosoyama et al. {Bioscl Biotechnol 
Biochem. 58:1500 (1994)) and by Ghosh et al. (J. Biotechnol 32:1 (1994)). The nucleic acids of the 
invention can be used to confer desired traits on essentially any plant. 
1 0 Thus, the invention has use over a broad range of plants, including species from the genera 

J Anacardium, Arachis, Asparagus, Atropa, Avena, Brassica, Citrus, Citrullus, Capsicum, 
^ Carthamus, Cocos, Coffea, Cucumis, Cucurbita, Daucus, Elaeis, Fragaria, Glycine, Gossypium, 
HI Helianthus, Heterocallis, Hordeum, Hyoscyamus, Lactuca, Linum, Lolium,Lupinus, Lycopersicon, 
^1 Malus, Manihot, Majorana, Medicago, Nicotiana, Olea, Oryza, Panieum, Pannesetum, Persea, 

Phaseolus, Pistachia, Pisum, Pyrus, Prunus, Raphanus, Ricinus, Secale, Senecio, Sinapis, Solanum, 
ip Sorghum, Theobromus, Trigonella, Triticum, Vicia, Vitis, Vigna, and, Zea. 

ify One of skill will recognize that after the expression cassette is stably incorporated in 

3d 

transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual 
crossing. Any of a number of standard breeding techniques can be used, depending upon the 
20 species to be crossed. 

The particular sequences of SDFs identified are provided in the attached Reference and 
Sequence tables. 
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Allelic variant: An "allelic variant" is an alternative form of the same SDF, which resides 

at the same chromosomal locus in the organism. Allelic variations can occur in any portion of 
the gene sequence, including regulatory regions. Allelic variants can arise by normal genetic 
variation in a population. Allelic variants can also be produced by genetic engineering methods. 
An allelic variant can be one that is found in a naturally occurring plant, including a cultivar or 
ecotype. An allelic variant may or may not give rise to a phenotypic change, and may or may 
not be expressed. An allele can result in a detectable change in the phenotype of the trait 
represented by the locus. A phenotypically silent allele can give rise to a product. 

Alternatively spliced messages: Within the context of the current invention, "alternatively 
spliced messages" refers to mature mRNAs originating from a single gene with variations in the 
number and/or identity of exons, introns and/or intron-exon junctions. 

Chimeric: The term "chimeric" is used to describe genes, as defined supra, or contructs 
wherein at least two of the elements of the gene or construct, such as the promoter and the coding 
sequence and/or other regulatory sequences and/or filler sequences and/or complements thereof, are 
heterologous to each other. 

Constitutive Promoter: Promoters referred to herein as "constitutive promoters" actively 

promote transcription under most, but not necessarily all, environmental conditions and states of 
development or cell differentiation. Examples of constitutive promoters include the cauliflower 
mosaic virus (CaMV) 35S transcript initiation region and the V or T promoter derived from 
T-DNA of Agrobacterium tumefaciens, and other transcription initiation regions from various plant 
genes, such as the maize ubiquitin-1 promoter, known to those of skill. 

Coordinately Expressed: The term "coordinately expressed," as used in the current 
invention, refers to genes that are expressed at the same or a similar time and/or stage and/or 
under the same or similar environmental conditions. 
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Domain: Domains are fingerprints or signatures that can be used to characterize protein 
families and/or parts of proteins. Such fingerprints or signatures can comprise conserved (1) 
primary sequence, (2) secondary structure, and/or (3) three-dimensional conformation. 
Generally, each domain has been associated with either a family of proteins or motifs. Typically, 
these families and/or motifs have been correlated with specific in-vitro and/or in-vivo activities. 
A domain can be any length, including the entirety of the sequence of a protein. Detailed 
descriptions of the domains, associated families and motifs, and correlated activities of the 
polypeptides of the instant invention are described below. Usually, the polypeptides with 
designated domain(s) can exhibit at least one activity that is exhibited by any polypeptide that 
comprises the same domain(s). 

Endogenous: The term "endogenous," within the context of the current invention refers to any 
polynucleotide, polypeptide or protein sequence which is a natural part of a cell or organisms 
regenerated from said cell. 

Exogenous: "Exogenous," as referred to within, is any polynucleotide, polypeptide or protein 
sequence, whether chimeric or not, that is initially or subsequently introduced into the genome 
of an individual host cell or the organism regenerated from said host cell by any means other 
than by a sexual cross. Examples of means by which this can be accomplished are described 
below, and include Agrobacterium-mediated transformation (of dicots - e.g. Salomon et al. 
EMBOJ. 3:141 (1984); Herrera-Estrella et al. EMBO J. 2:987 (1983); of monocots, 
representative papers are those by Escudero et al., Plant J. 10:355 (1996), Ishida et al., Nature 
Biotechnology 14:745 (1996), May et al., Bio/Technology 13:486 (1995)), biolistic methods 
(Armaleo et al., Current Genetics 17:97 1990)), electroporation, in planta techniques, and the 
like. Such a plant containing the exogenous nucleic acid is referred to here as a T 0 for the 
primary transgenic plant and Tj for the first generation. The term "exogenous" as used herein is 
also intended to encompass inserting a naturally found element into a non-naturally found 
location. 

Filler sequence: As used herein, "filler sequence" refers to any nucleotide sequence that is 
inserted into DNA construct to evoke a particular spacing between particular components such as 
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a promoter and a coding region and may provide an additional attribute such as a restriction 
enzyme site. 

Gene: The term "gene/' as used in the context of the current invention, encompasses all 

regulatory and coding sequence contiguously associated with a single hereditary unit with a 
genetic function (see SCHEMATIC 1). Genes can include non-coding sequences that modulate 
the genetic function that include, but are not limited to, those that specify polyadenylation, 
transcriptional regulation, DNA conformation, chromatin conformation, extent and position of 
base methylation and binding sites of proteins that control all of these. Genes comprised of 
"exons" (coding sequences), which may be interrupted by "introns" (non-coding sequences), 
encode proteins. A gene's genetic function may require only RNA expression or protein 
production, or may only require binding of proteins and/or nucleic acids without associated 
expression. In certain cases, genes adjacent to one another may share sequence in such a way 
that one gene will overlap the other. A gene can be found within the genome of an organism, 
artificial chromosome, plasmid, vector, etc., or as a separate isolated entity. 

Gene Family: "Gene family" is used in the current invention to describe a group of 

functionally related genes, each of which encodes a separate protein. 

Heterologous sequences: "Heterologous sequences" are those that are not operatively linked or 
are not contiguous to each other in nature. For example, a promoter from corn is considered 
heterologous to an Arabidopsis coding region sequence. Also, a promoter from a gene encoding a 
growth factor from corn is considered heterologous to a sequence encoding the corn receptor for the 
growth factor. Regulatory element sequences, such as UTRs or 3' end termination sequences that 
do not originate in nature from the same gene as the coding sequence originates from, are 
considered heterologous to said coding sequence. Elements operatively linked in nature and - 
contiguous to each other are not heterologous to each other. On the other hand, these same 
elements remain operatively linked but become heterologous if other filler sequence is placed 
between them. Thus, the promoter and coding sequences of a corn gene expressing an amino 
acid transporter are not heterologous to each other, but the promoter and coding sequence of a 
corn gene operatively linked in a novel manner are heterologous. 
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Homologous gene: In the current invention, "homologous gene" refers to a gene that shares 
sequence similarity with the gene of interest. This similarity may be in only a fragment of the 
sequence and often represents a functional domain such as, examples including without limitation a 
5 DNA binding domain, a domain with tyrosine kinase activity, or the like. The functional activities 
of homologous genes are not necessarily the same. 

Inducible Promoter: An "inducible promoter" in the context of the current invention refers to a 
promoter which is regulated under certain conditions, such as light, chemical concentration, protein 
10 concentration, conditions in an organism, cell, or organelle, etc. A typical example of an inducible 
; : g promoter, which can be utilized with the polynucleotides of the present invention, is PARSK1 , the 
j;*? promoter from the Arabidopsis gene encoding a serine-threonine kinase enzyme, and which 
yi promoter is induced by dehydration, abscissic acid and sodium chloride (Wang and Goodman, Plant 
jftj J, 8:37 (1 995)) Examples of environmental conditions that may affect transcription by inducible 
4 f> promoters include anaerobic conditions, elevated temperature, or the presence of light. 



sequence at a specific location. Mutants of the current invention may or may not have an 
associated identifiable function when the mutant gene is transcribed. 

Orthologous Gene: In the current invention "orthologous gene" refers to a second gene that 
25 encodes a gene product that performs a similar function as the product of a first gene. The 
orthologous gene may also have a degree of sequence similarity to the first gene. The 
orthologous gene may encode a polypeptide that exhibits a degree of sequence similarity to a 
polypeptide corresponding to a first gene. The sequence similarity can be found within a 
functional domain or along the entire length of the coding sequence of the genes and/or their 
30 corresponding polypeptides. 




Intergenic region: "Intergenic region," as used in the current invention, refers to nucleotide 
sequence occurring in the genome that separates adjacent genes. 



20 



Mutant gene: 



In the current invention, "mutant" refers to a heritable change in DNA 
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Percentage of sequence identity: "Percentage of sequence identity," as used herein, is 
determined by comparing two optimally aligned sequences over a comparison window, where the 
fragment of the polynucleotide or amino acid sequence in the comparison window may comprise 
additions or deletions (e.g., gaps or overhangs) as compared to the reference sequence (which does 
5 not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical nucleic acid base or amino 
acid residue occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the window of comparison and 
multiplying the result by 100 to yield the percentage of sequence identity. Optimal alignment of 
10 sequences for comparison may be conducted by the local homology algorithm of Smith and 
. ^ Waterman Add. APL. Math. 2:482 (1981), by the homology alignment algorithm of Needleman and 
h O Wunsch J. Mol Biol 48:443 (1970), by the search for similarity method of Pearson and Lipman 
Ul Proc, Natl Acad. Set (USA) 85: 2444 (1988), by computerized implementations of these algorithms 
jijj (GAP, BESTFIT, BLAST, PASTA, and TFASTA in the Wisconsin Genetics Software Package, 
N15 Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), or by inspection. Given that 
Q two sequences have been identified for comparison, GAP and BESTFIT are preferably employed to 
determine their optimal alignment. Typically, the default values of 5.00 for gap weight and 0.30 for 
"fr S a P weight length are used. The term "substantial sequence identity" between polynucleotide or 
polypeptide sequences refers to polynucleotide or polypeptide comprising a sequence that has at 
20 least 80% sequence identity, preferably at least 85%, more preferably at least 90% and most 

preferably at least 95%, even more preferably, at least 96%, 97%, 98% or 99% sequence identity 
compared to a reference sequence using the programs. 



Plant Promoter: A "plant promoter" is a promoter capable of initiating transcription in 

25 plant cells and can drive or facilitate transcription of a fragment of the SDF of the instant 

invention or a coding sequence of the SDF of the instant invention. Such promoters need not be 
of plant origin. For example, promoters derived from plant viruses, such as the CaMV35S promoter 
or from Agrobacterium tumefaciens such as the T-DNA promoters, can be plant promoters. A 
typical example of a plant promoter of plant origin is the maize ubiquitin-1 (ubi-l)promoter known 
30 to those of skill. 



734 



PATENT 

Attorney Docket No. 2750-1481P 
Promoter: The term "promoter/ 1 as used herein, refers to a region of sequence determinants 
located upstream from the start of transcription of a gene and which are involved in recognition and 
binding of RNA polymerase and other proteins to initiate and modulate transcription. A basal 
promoter is the minimal sequence necessary for assembly of a transcription complex required for 
transcription initiation. Basal promoters frequently include a "TATA box" element usually located 
between 15 and 35 nucleotides upstream from the site of initiation of transcription. Basal promoters 
also sometimes include a "CCAAT box" element (typically a sequence CCAAT) and/or a GGGCG 
sequence, usually located between 40 and 200 nucleotides, preferably 60 to 120 nucleotides, 
upstream from the start site of transcription. 

^ Public sequence: The term "public sequence ," as used in the context of the instant 

■ tj^ . 

H & application, refers to any sequence that has been deposited in a publicly accessible database. 

Ijl This term encompasses both amino acid and nucleotide sequences. Such sequences are publicly 

{qj accessible, for example, on the BLAST databases on the NCBI FTP web site (accessible at 

■15 ncbi.nlm.gov/blast). The database at the NCBI GTP site utilizes "gi" numbers assigned by NCBI 



10 
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as a unique identifier for each sequence in the databases, thereby providing a non-redundant 
ij^j database for sequence from various databases, including GenBank, EMBL, DBBJ, (DNA 
Database of Japan) and PDB (Brookhaven Protein Data Bank). 



20 Regulatory Sequence: The term "regulatory sequence," as used in the current invention, 

refers to any nucleotide sequence that influences transcription or translation initiation and rate, 
and stability and/or mobility of the transcript or polypeptide product. Regulatory sequences 
include, but are not limited to, promoters, promoter control elements, protein binding sequences, 
5' and 3' UTRs, transcriptional start site, termination sequence, polyadenylation sequence, 

25 introns, certain sequences within a coding sequence, etc. 

Related Sequences: "Related sequences" refer to either a polypeptide or a nucleotide sequence 
that exhibits some degree of sequence similarity with a sequence described by The Reference 
tables and The Sequence tables. 

30 
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Scaffold Attachment Region (SAR): As used herein, "scaffold attachment region" is a 

DNA sequence that anchors chromatin to the nuclear matrix or scaffold to generate loop domains 
that can have either a transcriptionally active or inactive structure (Spiker and Thompson (1996) 
Plant Physiol. 110: 15-21). 

Sequence-determined DNA fragments (SDFs): "Sequence-determined DNA fragments" as 
used in the current invention are isolated sequences of genes, fragments of genes, intergenic 
regions or contiguous DNA from plant genomic DNA or cDNA or RNA the sequence of which 
has been determined. 

Signal Peptide: A "signal peptide" as used in the current invention is an amino acid 

sequence that targets the protein for secretion, for transport to an intracellular compartment or 
U1 organelle or for incorporation into a membrane. Signal peptides are indicated in the tables and a 

m 

r|i more detailed description located below. 

% 
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Specific Promoter: In the context of the current invention, "specific promoters" refers to a 
subset of inducible promoters that have a high preference for being induced in a specific tissue or 
cell and/or at a specific time during development of an organism. By "high preference" is meant 
* at least 3 -fold, preferably 5-fold, more preferably at least 10-fold still more preferably at least 
20 20-fold, 50-fold or 100-fold increase in transcription in the desired tissue over the transcription 
in any other tissue. Typical examples of temporal and/or tissue specific promoters of plant origin 
that can be used with the polynucleotides of the present invention, are: PTA29, a promoter which is 
capable of driving gene transcription specifically in tapetum and only during anther development 
(Koltonow et al., Plant Cell 2:1201 (1990); RCc2 and RCc3, promoters that direct root-specific 
25 gene transcription in rice (Xu et al., Plant Mol Biol 27:237 (1995); TobRB27, a root-specific 
promoter from tobacco (Yamamoto et al., Plant Cell 3:371 (1991)). Examples of tissue-specific 
promoters under developmental control include promoters that initiate transcription only in certain 
tissues or organs, such as root, ovule, fruit, seeds, or flowers. Other suitable promoters include those 
from genes encoding storage proteins or the lipid body membrane protein, oleosin. A few root- 
30 specific promoters are noted above. 
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Stringency: "Stringency" as used herein is a function of probe length, probe composition (G + 
C content), and salt concentration, organic solvent concentration, and temperature of 
hybridization or wash conditions. Stringency is typically compared by the parameter T m , which 
is the temperature at which 50% of the complementary molecules in the hybridization are 
hybridized, in terms of a temperature differential from T m . High stringency conditions are those 
providing a condition of T m - 5°C to T m - 10°C. Medium or moderate stringency conditions are 
those providing T m - 20°C to T m - 29°C. Low stringency conditions are those providing a 
condition of T m - 40°C to T m - 48°C. The relationship of hybridization conditions to T m (in °C) is 
expressed in the mathematical equation 

T m = 81.5 -16.6(logi 0 [Na + ]) + 0.41(%G+C) - (600/N) (1) 



^ where N is the length of the probe. This equation works well for probes 14 to 70 nucleotides in 
jiy length that are identical to the target sequence. The equation below for T m of DNA-DNA hybrids is 
4*5 useful for probes in the range of 50 to greater than 500 nucleotides, and for conditions that include 
an organic solvent (formamide). 



T m - 81.5+16.6 log {[Na + ]/(l-f0.7[Na + ])}+ 0.41(%G+C)-500/L 0.63(%formamide) (2) 



20 where L is the length of the probe in the hybrid. (P. Tijessen, "Hybridization with Nucleic Acid 
Probes" in Laboratory Techniques in Biochemistry and Molecular Biology, P.C. vand der Vliet, 
ed., c. 1993 by Elsevier, Amsterdam.) The T m of equation (2) is affected by the nature of the 
hybrid; for DNA-RNA hybrids T m is 10-1 5°C higher than calculated, for RNA-RNA hybrids T m 
is 20-25°C higher. Because the T m decreases about 1 °C for each 1% decrease in homology when 

25 a long probe is used (Bonner et al., J. Mol Biol. 81 : 123 (1973)), stringency conditions can be 
adjusted to favor detection of identical genes or related family members. 

Equation (2) is derived assuming equilibrium and therefore, hybridizations according to 
the present invention are most preferably performed under conditions of probe excess and for 
sufficient time to achieve equilibrium. The time required to reach equilibrium can be shortened 

30 by inclusion of a hybridization accelerator such as dextran sulfate or another high volume 
polymer in the hybridization buffer. 
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Stringency can be controlled during the hybridization reaction or after hybridization has 
occurred by altering the salt and temperature conditions of the wash solutions used. The 
formulas shown above are equally valid when used to compute the stringency of a wash solution. 
Preferred wash solution stringencies lie within the ranges stated above; high stringency is 5-8°C 
below T m? medium or moderate stringency is 26-29°C below T m and low stringency is 45-48°C 
below T m . 

Substantially free of: A composition containing A is "substantially free of" B when at 

least 85% by weight of the total A+B in the composition is A. Preferably, A comprises at least 
about 90% by weight of the total of A+B in the composition, more preferably at least about 95% 
or even 99% by weight. For example, a plant gene or DNA sequence can be considered 
substantially free of other plant genes or DNA sequences. 

Translational start site: In the context of the current invention, a "translational start site" is 

usually an ATG in the cDNA transcript, more usually the first ATG. A single cDNA, however, 
may have multiple translational start sites. 

Transcription start site: "Transcription start site" is used in the current invention to describe 
the point at which transcription is initiated. This point is typically located about 25 nucleotides 
downstream from a TFIID binding site, such as a TATA box. Transcription can initiate at one or 
more sites within the gene, and a single gene may have multiple transcriptional start sites, some of 
which may be specific for transcription in a particular cell-type or tissue. 

Untranslated region (UTR): A "UTR" is any contiguous series of nucleotide bases that 

is transcribed, but is not translated. These untranslated regions may be associated with particular 
functions such as increasing mRNA message stability. Examples of UTRs include, but are not 
limited to polyadenylation signals, terminations sequences, sequences located between the 
transcriptional start site and the first exon (5' UTR) and sequences located between the last exon 
and the end of the mRNA (3' UTR). 
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Variant: The term "variant" is used herein to denote a polypeptide or protein or 
polynucleotide molecule that differs from others of its kind in some way. For example, 
polypeptide and protein variants can consist of changes in amino acid sequence and/or charge 
and/or post-translational modifications (such as glycosylation, etc). 
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EXAMPLES 

The invention is illustrated by way of the following examples. The invention is not 
limited by these examples as the scope of the invention is defined solely by the claims following. 

EXAMPLE 1 : cDNA PREPARATION 

A number of the nucleotide sequences disclosed in the Reference and Sequence tables or 
polynucleotides encoding polypeptides of the Protein Group or Protein Group Matrix tables, herein 
as representative of the SDFs of the invention can be obtained by sequencing genomic DNA 
(gDNA) and/or cDNA from corn plants grown from HYBRID SEED # 35A19, purchased from 
Pioneer Hi-Bred International, Inc., Supply Management, P.O. Box 256, Johnston, Iowa 50131- 
0256. 

A number of the nucleotide sequences disclosed in the Reference and Sequence tables or 
polynucleotides encoding polypeptides of the Protein Group or Protein Group Matrix tables, herein 
as representative of the SDFs of the invention can also be obtained by sequencing genomic DNA 
from Arabidopsis thaliana, Wassilewskija ecotype or by sequencing cDNA obtained from 
mRNA from such plants as described below. This is a true breeding strain. Seeds of the plant 
are available from the Arabidopsis Biological Resource Center at the Ohio State University, 
under the accession number CS2360. Seeds of this plant were deposited under the terms and 
conditions of the Budapest Treaty at the American Type Culture Collection, Manassas, VA on 
August 31, 1999, and were assigned ATCC No. PTA-595. 

Other methods for cloning full-length cDNA are described, for example, by Seki et al., 
Plant Journal 15:707-720 (1998) "High-efficiency cloning of Arabidopsis frill-length cDNA by 
biotinylated Cap trapper"; Maruyama et al., Gene 138 :171 (1994) "Oligo-capping a simple 
method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides"; and WO 
96/34981. 

Tissues were, or each organ was, individually pulverized and frozen in liquid nitrogen. 
Next, the samples were homogenized in the presence of detergents and then centrifuged. The 
debris and nuclei were removed from the sample and more detergents were added to the sample. 
The sample was centrifuged and the debris was removed. Then the sample was applied to a 2M 
sucrose cushion to isolate polysomes. The RNA was isolated by treatment with detergents and 
proteinase K followed by ethanol precipitation and centrifugation. The polysomal RNA from the 
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different tissues was pooled according to the following mass ratios: 15/15/1 for male 
inflorescences, female inflorescences and root, respectively. The pooled material was then used 
for cDNA synthesis by the methods described below. 

Starting material for cDNA synthesis for the exemplary corn cDNA clones with 
sequences presented in the Reference and Sequence tables or polynucleotides encoding 
polypeptides of the Protein Group or Protein Group Matrix tables was poly(A)-containing 
polysomal mRNAs from inflorescences and root tissues of corn plants grown from HYBRID 
SEED # 35 A 19. Male inflorescences and female (pre-and post-fertilization) inflorescences were 
isolated at various stages of development. Selection for poly(A) containing polysomal RNA was 
done using oligo d(T) cellulose columns, as described by Cox and Goldberg, "Plant Molecular 
Biology: A Practical Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The quality and 
the integrity of the polyA+ RNAs were evaluated. 

Starting material for cDNA synthesis for the exemplary Arabidopsis cDNA clones with 
sequences presented in the Reference and Sequence tables or polynucleotides encoding 
polypeptides of the Protein Group or Protein Group Matrix tables was polysomal RNA isolated 
from the top-most inflorescence tissues of Arabidopsis thaliana Wassilewskija (Ws.) and from 
roots of Arabidopsis thaliana Landsberg erecta (L. er.), also obtained from the Arabidopsis 
Biological Resource Center. Nine parts inflorescence to every part root was used, as measured 
by wet mass. Tissue was pulverized and exposed to liquid nitrogen. Next, the sample was 
homogenized in the presence of detergents and then centrifuged. The debris and nuclei were 
removed from the sample and more detergents were added to the sample. The sample was 
centrifuged and the debris was removed and the sample was applied to a 2M sucrose cushion to 
isolate polysomal RNA. Cox et aL, "Plant Molecular Biology: A Practical Approach", pp. 1-35, 
Shaw ed., c. 1988 by IRL, Oxford. The polysomal RNA was used for cDNA synthesis by the 
methods described below. Polysomal mRNA was then isolated as described above for corn 
cDNA. The quality of the RNA was assessed electrophoretically. 

Following preparation of the mRNAs from various tissues as described above, selection of 
mRNA with intact 5' ends and specific attachment of an oligonucleotide tag to the 5 5 end of such 
mRNA was performed using either a chemical or enzymatic approach. Both techniques take 
advantage of the presence of the "cap" structure, which characterizes the 5' end of most intact 
mRNAs and which comprises a guanosine generally methylated once, at the 7 position. 

741 



PATENT 

Attorney Docket No. 2750-1481P 
The chemical modification approach involves the optional elimination of the 2\ 3'-cis diol 
of the 3' terminal ribose, the oxidation of the 2\ 3'-cis diol of the ribose linked to the cap of the 5' 
ends of the mRNAs into a dialdehyde, and the coupling of the such obtained dialdehyde to a 
derivatized oligonucleotide tag. Further detail regarding the chemical approaches for obtaining 
mRNAs having intact 5 5 ends are disclosed in International Application No. W096/34981 
published November 7, 1996. 

The enzymatic approach for ligating the oligonucleotide tag to the intact 5' ends of mRNAs 
involves the removal of the phosphate groups present on the 5' ends of uncapped incomplete 
mRNAs, the subsequent decapping of mRNAs having intact 5' ends and the ligation of the 
phosphate present at the 5' end of the decapped mRNA to an oligonucleotide tag. Further detail 
regarding the enzymatic approaches for obtaining mRNAs having intact 5' ends are disclosed in 
Dumas Milne Edwards J.B. (Doctoral Thesis of Paris VI University, Le clonage des ADNc 
complets: difficultes et perspectives nouvelles. Apports pour Tetude de la regulation de l'expression 
de la tryptophane hydroxylase de rat, 20 Dec. 1993), EPO 625572 and Kato et al, Gene 150:243- 
250(1994). 

In both the chemical and the enzymatic approach, the oligonucleotide tag has a restriction 
enzyme site (e.g. an EcoRI site) therein to facilitate later cloning procedures. Following attachment 
of the oligonucleotide tag to the mRNA, the integrity of the mRNA is examined by performing a 
Northern blot using a probe complementary to the oligonucleotide tag. 

For the mRNAs joined to oligonucleotide tags using either the chemical or the enzymatic 
method, first strand cDNA synthesis is performed using an oligo-dT primer with reverse 
transcriptase. This oligo-dT primer can contain an internal tag of at least 4 nucleotides, which can 
be different from one mRNA preparation to another. Methylated dCTP is used for cDNA first 
strand synthesis to protect the internal EcoRI sites from digestion during subsequent steps. The first 
strand cDNA is precipitated using isopropanol after removal of RNA by alkaline hydrolysis to 
eliminate residual primers. 

Second strand cDNA synthesis is conducted using a DNA polymerase, such as Klenow 
fragment and a primer corresponding to the 5' end of the ligated oligonucleotide. The primer is 
typically 20-25 bases in length. Methylated dCTP is used for second strand synthesis in order to 
protect internal EcoRI sites in the cDNA from digestion during the cloning process. 
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Following second strand synthesis, the full-length cDNAs are cloned into a phagemid 
vector, such as pBlueScript™ (Stratagene). The ends of the full-length cDNAs are blunted with T4 
DNA polymerase (Biolabs) and the cDNA is digested with EcoRI. Since methylated dCTP is used 
during cDNA synthesis, the EcoRI site present in the tag is the only hemi-methylated site; hence the 
only site susceptible to EcoRI digestion. In some instances, to facilitate subcloning, an Hind III 
adapter is added to the 3' end of full-length cDNAs. 

The full-length cDNAs are then size fractionated using either exclusion chromatography 
(AcA, Biosepra) or electrophoretic separation which yields 3 to 6 different fractions. The full- 
length cDNAs are then directionally cloned either into pBlueScript™ using either the EcoRI and 
Smal restriction sites or, when the Hind III adapter is present in the full-length cDNAs, the EcoRI 
and Hind III restriction sites. The ligation mixture is transformed, preferably by electroporation, into 
bacteria, which are then propagated under appropriate antibiotic selection. 

Clones containing the oligonucleotide tag attached to full-length cDNAs are selected as 
follows. 

The plasmid cDNA libraries made as described above are purified (e.g. by a column 
available from Qiagen). A positive selection of the tagged clones is performed as follows. Briefly, 
in this selection procedure, the plasmid DNA is converted to single stranded DNA using phage Fl 
gene II endonuclease in combination with an exonuclease (Chang et al, Gene 127:95 (1993)) such 
as exonuclease III or T7 gene 6 exonuclease. The resulting single stranded DNA is then purified 
using paramagnetic beads as described by Fry et al. , Biotechniques 13: 124 (1992). Here the single 
stranded DNA is hybridized with a biotinylated oligonucleotide having a sequence corresponding to 
the 3' end of the oligonucleotide tag. Preferably, the primer has a length of 20-25 bases. Clones 
including a sequence complementary to the biotinylated oligonucleotide are selected by incubation 
with streptavidin coated magnetic beads followed by magnetic capture. After capture of the positive 
clones, the plasmid DNA is released from the magnetic beads and converted into double stranded 
DNA using a DNA polymerase such as Thermo Sequenase™ (obtained from Amersham Pharmacia 
Biotech). Alternatively, protocols such as the Gene Trapper™ kit (Gibco BRL) can be used. The 
double stranded DNA is then transformed, preferably by electroporation, into bacteria. The 
percentage of positive clones having the 5' tag oligonucleotide is typically estimated to be between 
90 and 98% from dot blot analysis. 
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Following transformation, the libraries are ordered in microtiter plates and sequenced. 
The Arabidopsis library was deposited at the American Type Culture Collection on January 7, 
2000 as "E-coli liba 010600" under the accession numbe r PTA-1161 . 

G. EXAMPLE 2: Southern hybridizations 

The SDFs of the invention can be used in Southern hybridizations as described above. 
The following describes extraction of DNA from nuclei of plant cells, digestion of the nuclear 
DNA and separation by length, transfer of the separated fragments to membranes, preparation of 
probes for hybridization, hybridization and detection of the hybridized probe. 

The procedures described herein can be used to isolate related polynucleotides or for 
diagnostic purposes. Moderate stringency hybridization conditions, as defined above, are 
described in the present example. These conditions result in detection of hybridization between 
sequences having at least 70% sequence identity. As described above, the hybridization and 
wash conditions can be changed to reflect the desired percenatge of sequence identity between 
probe and target sequences that can be detected. 

In the following procedure, a probe for hybridization is produced from two PCR reactions 
using two primers from genomic sequence of Arabidopsis thaliana. As described above, the 
particular template for generating the probe can be any desired template. 

The first PCR product is assessed to validate the size of the primer to assure it is of the 
expected size. Then the product of the first PCR is used as a template, with the same pair of 
primers used in the first PCR, in a second PCR that produces a labeled product used as the probe. 

Fragments detected by hybridization, or other bands of interest, can be isolated from gels 
used to separate genomic DNA fragments by known methods for further purification and/or 
characterization. 



Buffers for nuclear DNA extraction 
1. 10XHB 





1000 ml 




40 mM spermidine 


10.2 g 


Spermine (Sigma S-2876) and spermidine (Sigma 
S-2501) 
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10 mM spermine 


3.5 g 


Stabilize chromatin and the nuclear membrane 


0.1 MEDTA 
(disodium) 


37.2 g 


EDTA inhibits nuclease 


0.1 MTris 


12.1 g 


Buffer 


0.8 M KC1 


59.6 g 


Adjusts ionic strength for stability of nuclei 



Adjust pH to 9.5 with ION NaOH. It appears that there is a nuclease present in leaves. 
Use of pH 9.5 appears to inactivate this nuclease. 

2. 2 M sucrose (684 g per 1000 ml) 

Heat about half the final volume of water to about 50 C. Add the sucrose slowly then 
bring the mixture to close to final volume; stir constantly until it has dissolved. Bring the 
solution to volume. 

3. Sarkosyl solution (lyses nuclear membranes) 

1000 ml 

N-lauroyl sarcosine (Sarkosyl) 20.0 g 

0.1 MTris 12.1 g 

0.04 M EDTA (Disodium) 14.9 g 

Adjust the pH to 9.5 after all the components are dissolved and bring up to the proper 
volume. 

4. 20% Triton X- 100 
80 ml Triton X-100 

320 ml lxHB (w/o p-ME and PMSF) 
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Prepare in advance; Triton takes some time to dissolve 

Procedure 

Prepare IX "H" buffer (keep ice-cold during use) 

1000 ml 

10XHB 100 ml 

2 M sucrose 250 ml a non-ionic osmoticum 

Water 634 ml 

Added just before use: 

100 mM PMSF* 10 ml a protease inhibitor; protects nuclear 

membrane proteins 

B-mercaptoethanol 1 ml inactivates nuclease by reducing 

disulfide bonds 

*100 mM PMSF 

(phenyl methyl sulfonyl fluoride, Sigma P-7626) 
(add 0.0875 g to 5 ml 100% ethanol) 

Homogenize the tissue in a blender (use 300-400 ml of lxHB per blender). Be sure that 
you use 5-10 ml of HB buffer per gram of tissue. Blenders generate heat so be sure to 
keep the homogenate cold. It is necessary to put the blenders in ice periodically. 

Add the 20% Triton X-100 (25 ml per liter of homogenate) and gently stir on ice for 20 
min. This lyses plastid, but not nuclear, membranes. 

Filter the tissue suspension through several nylon filters into an ice-cold beaker. The first 
filtration is through a 250-micron membrane; the second is through an 85-micron 
membrane; the third is through a 50-micron membrane; and the fourth is through a 20- 
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micron membrane. Use a large funnel to hold the filters. Filtration can be sped up by 
gently squeezing the liquid through the filters. 

Centrifuge the filtrate at 1 200 x g for 20 min. at 4°C to pellet the nuclei. 

Discard the dark green supernatant. The pellet will have several layers to it. One is 
starch; it is white and gritty. The nuclei are gray and soft. In the early steps, there may 
be a dark green and somewhat viscous layer of chloroplasts. 

Wash the pellets in about 25 ml cold H buffer (with Triton X-100) and resuspend by 
swirling gently and pipetting. After the pellets are resuspended. 

Pellet the nuclei again at 1200 - 1300 x g. Discard the supernatant. 

Repeat the wash 3-4 times until the supernatant has changed from a dark green to a pale 
green. This usually happens after 3 or 4 resuspensions. At this point, the pellet is 
typically grayish white and very slippery. The Triton X-100 in these repeated steps helps 
to destroy the chloroplasts and mitochondria that contaminate the prep. 

Resuspend the nuclei for a final time in a total of 1 5 ml of H buffer and transfer the 
suspension to a sterile 125 ml Erlenmeyer flask. 

Add 15 ml, dropwise, cold 2% Sarkosyl, 0.1 M Tris, 0.04 M EDTA solution (pH 9.5) 
while swirling gently. This lyses the nuclei. The solution will become very viscous. 

Add 30 grams of CsCl and gently swirl at room temperature until the CsCl is in solution. 
The mixture will be gray, white and viscous. 

Centrifuge the solution at 1 1,400 x g at 4°C for at least 30 min. The longer this spin is, 
the firmer the protein pellicle. 
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The result is typically a clear green supernatant over a white pellet, and (perhaps) under a 
protein pellicle. Carefully remove the solution under the protein pellicle and above the 
pellet. Determine the density of the solution by weighing 1 ml of solution and add CsCl 
if necessary to bring to 1 .57 g/ml. The solution contains dissolved solids (sucrose etc) 
and the refractive index alone will not be an accurate guide to CsCl concentration. 

Add 20 (Ltl of 10 mg/ml EtBr per ml of solution. 

Centrifuge at 184,000 x g for 16 to 20 hours in a fixed-angle rotor. 

Remove the dark red supernatant that is at the top of the tube with a plastic transfer 
pipette and discard. Carefully remove the DNA band with another transfer pipette. The 
DNA band is usually visible in room light; otherwise, use a long wave UV light to locate 
the band. 

Extract the ethidium bromide with isopropanol saturated with water and salt. Once the 
solution is clear, extract at least two more times to ensure that all of the EtBr is gone. Be 
very gentle, as it is very easy to shear the DNA at this step. This extraction may take a 
while because the DNA solution tends to be very viscous. If the solution is too viscous, 
dilute it with TE. 

Dialyze the DNA for at least two days against several changes (at least three times) of TE 
(10 mM Tris, ImM EDTA, pH 8) to remove the cesium chloride. 

Remove the dialyzed DNA from the tubing. If the dialyzed DNA solution contains a lot 
of debris, centrifuge the DNA solution at least at 2500 x g for 10 min. and carefully 
transfer the clear supernatant to a new tube. Read the A260 concentration of the DNA. 

Assess the quality of the DNA by agarose gel electrophoresis (1% agarose gel) of the 
DNA. Load 50 ng and 100 ng (based on the OD reading) and compare it with known and 
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good quality DNA. Undigested lambda DNA and a lambda-Hindlll-digested DNA are 
good molecular weight makers. 

Protocol for Digestion of Genomic DNA 
Protocol : 

1 . The relative amounts of DNA for different crop plants that provide approximately a 
balanced number of genome equivalent is given in Table 3. Note that due to the size of 
the wheat genome, wheat DNA will be underrepresented. Lambda DNA provides a 
useful control for complete digestion. 

2. Precipitate the DNA by adding 3 volumes of 100% ethanol. Incubate at -20°C for at least 
two hours. Yeast DNA can be purchased and made up at the necessary concentration, 
therefore no precipitation is necessary for yeast DNA. 

m 
m 

W 3. Centrifuge the solution at 1 1,400 x g for 20 min. Decant the ethanol carefully (be careful 
Q not to disturb the pellet). Be sure that the residual ethanol is completely removed either 

^1 by vacuum desiccation or by carefully wiping the sides of the tubes with a clean tissue. 

y! 4. Resuspend the pellet in an appropriate volume of water. Be sure the pellet is fully 
1 5 resuspended before proceeding to the next step. This may take about 30 min. 

5. Add the appropriate volume of 10X reaction buffer provided by the manufacturer of the 
restriction enzyme to the resuspended DNA followed by the appropriate volume of 
enzymes. Be sure to mix it properly by slowly swirling the tubes. 

6. Set-up the lambda digestion-control for each DNA that you are digesting. 

20 7. Incubate both the experimental and lambda digests overnight at 37°C. Spin down 
condensation in a microfuge before proceeding. 
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8. After digestion, add 2 jal of loading dye (typically 0.25% bromophenol blue, 0.25% 
xylene cyanol in 15% Ficoll or 30% glycerol) to the lambda-control digests and load in 
1% TPE-agarose gel (TPE is 90 mM Tris-phosphate, 2 mM EDTA, pH 8). If the lambda 
DNA in the lambda control digests are completely digested, proceed with the 

5 precipitation of the genomic DNA in the digests. 

9. Precipitate the digested DNA by adding 3 volumes of 100% ethanol and incubating in - 
20°C for at least 2 hours (preferably overnight). 

EXCEPTION: Arabidopsis and yeast DNA are digested in an appropriate volume; they 
. J? don't have to be precipitated. 

Resuspend the DNA in an appropriate volume of TE (e.g., 22 (il x 50 blots = 1 100 jal) 
and an appropriate volume of 10X loading dye (e.g., 2.4 jal x 50 blots = 120 jal). Be 
careful in pipetting the loading dye - it is viscous. Be sure you are pipetting the correct 
volume. 

I 



IB 10. 

m 
m 



Table 3 



Some guide points in digesting genomic DNA. 



Species 


Genome Size 


Size Relative to 
Arabidopsis 


Genome 

Equivalent to 2 \ig 
Arabidopsis DNA 


Amount 
of DNA 
per blot 


Arabidopsis 


120 Mb 


IX 


IX 


2 ug 


Brassica 


1,100 Mb 


9.2X 


0.54X 


10 ug 


Corn 


2,800 Mb 


23.3X 


0.43X 


20 ng 



750 



PATENT 

Attorney Docket No. 2750-1481P 



Cotton 


2,300 Mb 


19.2X 


0.52X 


20 ng 


Oat 


11,300 Mb 


94X 


0.1 IX 


20 jig 


Rice 


400 Mb 


3.3X 


0.75X 


5 l^g 


Soybean 


1,100 Mb 


9.2X 


0.54X 


10 jig 


Sugarbeet 


758 Mb 


6.3X 


0.8X 


10 Hg 


Sweetclover 


1,100 Mb 


9.2X 


0.54X 


10 ^ 


Wheat 


16,000 Mb 


133X 


0.08X 


20 jxg 


Yeast 


15 Mb 


0.1 2X 


IX 


0.25 |ag 



Protocol for Southern Blot Analysis 

The digested DNA samples are electrophoresed in 1% agarose gels in lx TPE buffer. Low 
voltage; overnight separations are preferred. The gels are stained with EtBr and photographed. 

1 . For blotting the gels, first incubate the gel in 0.25 N HC1 (with gentle shaking) for about 
1 5 min. 

2. Then briefly rinse with water. The DNA is denatured by 2 incubations. Incubate (with 
shaking) in 0.5 M NaOH in 1.5 M NaCl for 15 min. 

3. The gel is then briefly rinsed in water and neutralized by incubating twice (with shaking) 
in 1.5 M Tris pH 7.5 in 1.5 M NaCl for 15 min. 

4. A nylon membrane is prepared by soaking it in water for at least 5 min, then in 6X SSC 
for at least 15 min. before use. (20x SSC is 175.3 g NaCl, 88.2 g sodium citrate per liter, 
adjusted to pH 7.0.) 

5. The nylon membrane is placed on top of the gel and all bubbles in between are removed. 
The DNA is blotted from the gel to the membrane using an absorbent medium, such as 
paper toweling and 6x SCC buffer. After the transfer, the membrane may be lightly 
brushed with a gloved hand to remove any agarose sticking to the surface. 
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6. The DNA is then fixed to the membrane by UV crosslinking and baking at 80°C. The 
membrane is stored at 4°C until use. 

B. Protocol for PCR Amplification of Genomic Fragments in Arabidopsis 



Amplification procedures : 
5 1 . Mix the following in a 0.20 ml PCR tube or 96-well PCR plate: 





Volume 


Stock 


Final Amount or Cone. 


;Fi 

Id 


0.5 fil 


- 10 ng/jal genomic DNA 1 


5 ng 


i! H 

m 

i £« 


2.5 nl 


10X PCR buffer 


20 mM Tris, 50 mM KC1 


; J 5 

■;a> ; 

M 


0.75 (il 


50 mM MgCl 2 


1.5 mM 


ru 


1 Ml 


1 0 pmol/jjl Primer 1 (Forward) 


1 0 pmol 




1 Ml 


10 pmol/|u,l Primer 2 (Reverse) 


10 pmol 




0.5 jlxI 


5 mM dNTPs 


0.1 mM 




0.1 nl 


5 units/|il Platinum Taq™ (Life 
Technologies, Gaithersburg, MD) 
DNA Polymerase 


1 units 




(to 25 jil) 


Water 





1 Arabidopsis DNA is used in the present experiment, but the procedure is a general one. 
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2. The template DNA is amplified using a Perkin Elmer 9700 PCR machine: 

1) 94 C for 10 min. followed by 



2) 




3) 




4) 


5 cycles: 




5 cycles: 




25 cycles: 


94 °C- 


30 sec 


94 °C- 


30 sec 


94 °C - 30 sec 


62 °C- 


30 sec 


58 °C- 


30 sec 


53 °C- 30 sec 


72 °C- 


3 min 


72 °C- 


3 min 


72 °C - 3 min 



5) 72 C for 7 min. Then the reactions are stopped by chilling to 4°C. 
The procedure can be adapted to a multi-well format if necessary. 
Quantification and Dilution of PCR Products: 

1 . The product of the PCR is analyzed by electrophoresis in a 1% agarose gel. A linearized 
plasmid DNA can be used as a quantification standard (usually at 50, 100 3 200, and 400 
ng). These will be used as references to approximate the amount of PCR products. 
Hindlll-digested Lambda DNA is useful as a molecular weight marker. The gel can be 
run fairly quickly; e.g., at 100 volts. The standard gel is examined to determine that the 
size of the PCR products is consistent with the expected size and if there are significant 
extra bands or smeary products in the PCR reactions. 

2. The amounts of PCR products can be estimated on the basis of the plasmid standard. 

3. For the small number of reactions that produce extraneous bands, a small amount of 
DNA from bands with the correct size can be isolated by dipping a sterile 1 0-jjI tip into 
the band while viewing though a UV Transilluminator. The small amount of agarose gel 
(with the DNA fragment) is used in the labeling reaction. 
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C. Protocol for PCR-DIG-Labeling of DNA 
Solutions : 

Reagents in PCR reactions (diluted PCR products, 10X PCR Buffer, 50 mM MgCl 2 , 5 
U/jal Platinum Taq Polymerase, and the primers) 

10X dNTP + DIG-1 1-dUTP [1 :5]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 1 .65 mM 
dTTP, 0.35 mM DIG-1 1-dUTP) 

10X dNTP + DIG-ll-dUTP [1:10]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 1.81 mM 
dTTP, 0.19 mM DIG-1 1-dUTP) 

10X dNTP + DIG-ll-dUTP [1:15]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 1.875 mM 
dTTP, 0.125 mM DIG-1 1-dUTP) 

TE buffer (10 mM Tris, 1 mM EDTA, pH 8) 

Maleate buffer: In 700 ml of deionized distilled water, dissolve 1 1 .61 g maleic acid and 
8.77 g NaCl. Add NaOH to adjust the pH to 7.5. Bring the volume to 1 L. Stir for 15 
min. and sterilize. 

10% blocking solution: In 80 ml deionized distilled water, dissolve 1.1 6g maleic acid. 
Next, add NaOH to adjust the pH to 7.5. Add 10 g of the blocking reagent powder 
(Boehringer Mannheim, Indianapolis, IN, Cat. no. 1096176). Heat to 60°C while stirring 
to dissolve the powder. Adjust the volume to 100 ml with water. Stir and sterilize. 

1% blocking solution: Dilute the 10% stock to 1% using the maleate buffer. 

Buffer 3 (100 mM Tris, 100 mM NaCl, 50 mM MgCl 2 , pH9.5). Prepared from 
autoclaved solutions of 1M Tris pH 9.5, 5 M NaCl, and 1 M MgCl 2 in autoclaved 
distilled water. 
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Procedure: 



1 . PCR reactions are performed in 25 jj.1 volumes containing: 



PCR buffer 
MgCl 2 

10XdNTP + DIG-ll-dUTP 
Platinum Taq™ Polymerase 
1 0 pg probe DNA 
1 0 pmol primer 1 



IX 

1.5 mM 

IX (please see the note below) 
1 unit 



Note: 



IS 

m 
m 

fit! 

m 

SU 



IPX dNTP + DIG-1 1-dUTP (1:5) 



10X dNTP + DIG-1 1-dUTP (1:10) 
1 OX dNTP + DIG- 1 1 -dUTP (1:15) 



Use for : 
< 1 kb 



1 kbto 1.8 kb 
> 1.8 kb 



2. The PCR reaction uses the following amplification cycles: 
1) 94°Cfor lOmin. 



2) 




3) 




4) 


5 cycles: 




5 cycles: 




25 cycles: 


95°C - 


30 sec 


95°C - 


30 sec 


95°C -30 sec 


61°C - 


1 min 


59°C - 


1 min 


51°C - 1 min 


73°C - 


5 min 


75°C - 


5 min 


73°C - 5 min 



15 



5) 72 C for 8 min. The reactions are terminated by chilling to 4 C (hold). 



3. The products are analyzed by electrophoresis- in a 1% agarose gel, comparing to an 
aliquot of the unlabelled probe starting material. 
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The amount of DIG-labeled probe is determined as follows: 

Make serial dilutions of the diluted control DNA in dilution buffer (TE: 10 mM Tris and 
1 mM EDTA, pH 8) as shown in the following table: 



DIG-labeled control 
DNA starting cone. 


Stepwise Dilution 


Final Cone. (Dilution 
Name) 


5 ng/ul 


1 ul in 49 ul TE 


100 pg/ul (A) 


100 pg/ul(A) 


25 ul in 25 ul TE 


50 pg/^il (B) 


50 pg/ul (B) 


25 ul in 25 ul TE 


25 pg/ul (C) 


25 pg/ul (C) 


20 ul in 30 ul TE 


10 pg/ul (D) 



a. Serial deletions of a DIG-labeled standard DNA ranging from 1 00 pg to 10 pg are 
spotted onto a positively charged nylon membrane, marking the membrane lightly 
with a pencil to identify each dilution. 

b. Serial dilutions (e.g., 1 :50, 1 :2500, 1 : 1 0,000) of the newly labeled DNA probe are 
spotted. 

c. The membrane is fixed by UV crosslinking. 

d. The membrane is wetted with a small amount of maleate buffer and then 
incubated in 1% blocking solution for 15 min at room temp. 

e. The labeled DNA is then detected using alkaline phosphatase conjugated anti- 
DIG antibody (Boehringer Mannheim, Indianapolis, IN, cat. no. 1093274) and an 
NBT substrate according to the manufacture's instruction. 
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f. Spot intensities of the control and experimental dilutions are then compared to 
estimate the concentration of the PCR-DIG-labeled probe. 

D. Prehybridization and Hybridization of Southern Blots 
Solutions: 



100% Formamide purchased from Gibco 

20X SSC (IX = 0.15 M NaCl, 0.015 M Na 3 citrate) 

perL: 175gNaCl 

87.5 g Na 3 citrate-2H 2 0 

20% Sarkosyl (N-lauroyl-sarcosine) 

20% SDS (sodium dodecyl sulphate) 

10% Blocking Reagent: In 80 ml deionized distilled water, dissolve 1.16 g maleic acid. 
Next, add NaOH to adjust the pH to 7.5. Add 10 g of the blocking reagent powder. 
Heat to 60°C while stirring to dissolve the powder. Adjust the volume to 100 ml 
with water. Stir and sterilize. 



Prehybridization Mix: 



Final 

Concentration 


Components 


Volume 
(per 100 ml) 


Stock 


50% 


Formamide 


50 ml 


100% 


5X 


SSC 


25 ml 


20X 


0.1% 


Sarkosyl 


0.5 ml 


20% 


0.02% 


SDS 


0.1 ml 


20% 


2% 


Blocking Reagent 


20 ml 


10% 




Water 


4.4 ml 
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General Procedures : 

1 . Place the blot in a heat-sealable plastic bag and add an appropriate volume of 
prehybridization solution (30 ml/1 00cm 2 ) at room temperature. Seal the bag with a heat 
sealer, avoiding bubbles as much as possible. Lay down the bags in a large plastic tray 
(one tray can accommodate at least 4-5 bags). Ensure that the bags are lying flat in the 
tray so that the prehybridization solution is evenly distributed throughout the bag. 
Incubate the blot for at least 2 hours with gentle agitation using a waver shaker. 

2. Denature DIG-labeled DNA probe by incubating for 10 min. at 98°C using the PCR 
machine and immediately cool it to 4°C. 

3. Add probe to prehybridization solution (25 ng/ml; 30 ml = 750 ng total probe) and mix 
well but avoid foaming. Bubbles may lead to background. 

4. Pour off the prehybridization solution from the hybridization bags and add new 
prehybridization and probe solution mixture to the bags containing the membrane. 

5. Incubate with gentle agitation for at least 16 hours. 

6. Proceed to medium stringency post-hybridization wash: 

Three times for 20 min. each with gentle agitation using IX SSC, 1% SDS at 60°C. 

All wash solutions must be prewarmed to 60 C. Use about 100 ml of wash solution per 
membrane. 

To avoid background keep the membranes fully submerged to avoid drying in spots; 
agitate sufficiently to avoid having membranes stick to one another. 




7. After the wash, proceed to immunological detection and CSPD development. 
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E. Procedure for Immunological Detection with CSPD 
Solutions: 



Buffer 1: 



Maleic acid buffer (0.1 M maleic acid, 0.15 M NaCl; 
adjusted to pH 7.5 with NaoH) 



Washing buffer: 
Blocking stock solution 



Maleic acid buffer with 0.3% (v/v) Tween 20. 

10% blocking reagent in buffer 1 . Dissolve (10X 
concentration): blocking reagent powder (Boehringer 
Mannheim, Indianapolis, IN, cat. no. 1096176) by 
constantly stirring on a 65°C heating block or heat in a 
microwave, autoclave and store at 4 C. 



Buffer 2 

(IX blocking solution): 



Dilute the stock solution 1:10 in Buffer 1. 



Detection buffer: 



0.1 M Tris, 0.1 M NaCl, pH 9.5 



Procedure : 

1 . After the post-hybridization wash the blots are briefly rinsed (1-5 min.) in the maleate 
washing buffer with gentle shaking. 

2. Then the membranes are incubated for 30 min. in Buffer 2 with gentle shaking. 

3. Anti-DIG-AP conjugate (Boehringer Mannheim, Indianapolis, IN, cat. no. 1093274) at 
75 mU/ml (1 : 10,000) in Buffer 2 is used for detection. 75 ml of solution can be used for 
3 blots. 

4. The membrane is incubated for 30 min. in the antibody solution with gentle shaking. 

5. The membrane are washed twice in washing buffer with gentle shaking. About 250 mis 
is used per wash for 3 blots. 

6. The blots are equilibrated for 2-5 min in 60 ml detection buffer. 
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7. Dilute CSPD (1 :200) in detection buffer. (This can be prepared ahead of time and stored 
in the dark at 4°C). 



The following steps must be done individually. Bags (one for detection and one for 
exposure) are generally cut and ready before doing the following steps. 

The blot is carefully removed from the detection buffer and excess liquid removed 
without drying the membrane. The blot is immediately placed in a bag and 1.5 ml of 
CSPD solution is added. The CSPD solution can be spread over the membrane. Bubbles 
present at the edge and on the surface of the blot are typically removed by gentle 
rubbing. The membrane is incubated for 5 min. in CSPD solution. 
Excess liquid is removed and the membrane is blotted briefly (DNA side up) on 
Whatman 3 MM paper. Do not let the membrane dry completely. 
Seal the damp membrane in a hybridization bag and incubate for 10 min at 37 C to 
enhance the luminescent reaction. 

Expose for 2 hours at room temperature to X-ray film. Multiple exposures can be taken. 
Luminescence continues for at least 24 hours and signal intensity increases during the 
first hours. 

EXAMPLE 3: MICRO ARRAY EXPERIMENTS AND RESULTS 

20 Example 3: MICRO ARRAY EXPERIMENTS AND RESULTS 
1. Sample Tissue Preparation 

(a) Roots 

Seeds of Arabidopsis thaliana ( Ws) were sterilized in full strength bleach for less than 5 
25 min., washed more than 3 times in sterile distilled deionized water and plated on MS agar plates. 
The plates were placed at 4°C for 3 nights and then placed vertically into a growth chamber 
having 16 hr light/8 hr dark cycles, 23 °C, 70% relative humidity and ~1 1,000 LUX. After 2 
weeks, the roots were cut from the agar, flash frozen in liquid nitrogen and stored at - 
80°C.(EXPT REP: 108439 and 108434) 
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(b) Root Hairless mutants 

Plants mutant at the rhl gene locus lack root hairs. This mutation is maintained as a 
5 heterozygote. 

Seeds of Arabidopsis thaliana (Landsberg erecta) mutated at the rhl gene locus were 
sterilized using 30% bleach with 1 ul/ml 20% Triton -X 100 and then vernalized at 4°C for 3 
days before being plated onto GM agar plates. Plates were placed in growth chamber with 1 6 hr 
light/8 hr. dark, 23°C, 14,500-15,900 LUX, and 70% relative humidity for germination and 
10 growth. 

After 7 days, seedlings were inspected for root hairs using a dissecting microscope. 
=-|3 Mutants were harvested and the cotyledons removed so that only root tissue remained. Tissue 
jjj was then flash frozen in liquid nitrogen and stored at -80C. (EXPT REP: 108433) 
Jj ! Arabidopsis thaliana (Landsberg erecta) seedlings grown and prepared as above were 

m used as controls. (EXPT REP: 108433) 

Alternatively, seeds of Arabidopsis thaliana (Landsberg erecta), heterozygous for the 
w rhll (root hairless) mutation, were surface-sterilized in 30% bleach containing 0.1% Triton X- 

flj 

:: p 100 and further rinsed in sterile water. They were then vernalized at 4° C for 4 days before being 
2 plated onto MS agar plates. The plates were maintained in a growth chamber at 24°C with 16 hr 
20 light/8 hr dark for germination and growth. After 1 0 days, seedling roots that expressed the 
phenotype (i.e. lacking root hairs) were cut below the hypocotyl junction, frozen in liquid 
nitrogen and stored at -80°C. Those seedlings with the normal root phenotype (heterozygous or 
wt) were collected as described for the mutant and used as controls. 

25 (c) Rosette Leaves, Stems, and Siliques Arabidopsis thaliana (Ws) seed was 

vernalized at4° C for 3 days before sowing in Metro-mix soil type 350. Flats were placed in a 
growth chamber having 16 hr light/8 hr dark, 80% relative humidity, 23°C and 13,000 LUX for 
germination and growth. After 3 weeks, rosette leaves, stems, and siliques (see EXPT REP: 
108436, 108437 and 108438) were harvested, flash frozen in liquid nitrogen and stored at -80°C 

30 until use. After 4 weeks, siliques (<5mm, 5-10 mm and >10 mm) were harvested, flash frozen in 



761 




PATENT 

Attorney Docket No. 2750-1481P 
liquid nitrogen and stored at -80°C until use. 5 week old whole plants (used as controls) were 
harvested, flash frozen in liquid nitrogen and kept at -80°C until RNA was isolated. 

(d) Trichomes 

5 Arabidopsis thaliana (Colombia glabrous) inflorescences were used as a control and 

CS8143 (hairy inflorescence ecotype) inflorescences, having increased trichomes, were used as 
the experimental sample. 

Approximately 10 \x\ of each type of seed was sown on a flat of 350 soil (containing 
0.03% marathon) and vernalized at 4° C for 3 days. Plants were then grown at room temperature 
10 under florescent lighting. Young inflorescences were collected at 30 days for the control plants 

^jf and 37 days for the experimental plants. Each inflorescence was cut into one-half inch (1/2") 

■-•3" 

sQ pieces, flash frozen in liquid nitrogen and stored at -80 C until RNA was isolated. 

W 
«1 

w 
m 

iljS (e) Germination 

Arabidopsis thaliana seeds (ecotype Ws) were sterilized in bleach and rinsed with sterile 
03 water. The seeds were placed in 100mm petri plates containing soaked autoclaved filter paper. 
: ,p Plates were foil-wrapped and left at 4 C for 3 nights to vernalize. After cold treatment, the foil 

was removed and plates were placed into a growth chamber having 16 hr light/8 hr dark cycles, 
20 23 °C, 70% relative humidity and ~1 1,000 lux. Seeds were collected 1 d (EXPT REP: 108461), 

2 d (EXPT REP: 108462), 3 d (EXPT REP: 108463) and 4 d (EXPT REP: 108464) later, flash 

frozen in liquid nitrogen and stored at -80°C until RNA was isolated. 

(f) Shoot Apical Meristem 

25 Arabidopsis thaliana (Landsberg erecta) plants mutant at the stm gene locus lack shoot 

meristems, produce aerial rosettes, have a reduced number of flowers per inflorescence, as well 
as a reduced number of petals, stamens and carpels, and is female sterile. This mutation is 
maintained as a heterozygote. 

Seeds of Arabidopsis thaliana (Landsberg erecta) mutated at the stm locus were sterilized 

30 using 30% bleach with 1 ul/ml 20% Triton -XI 00. The seeds were vernalized at 4°C for 3 days 
before being plated onto GM agar plates. Half were then put into a 22°C, 24 hr light growth 
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chamber and half in a 24°C 16 hr light/8 hr dark growth chamber having 14,500-15,900 LUX, 
and 70% relative humidity for germination and growth. 

After 7 days, seedlings were examined for leaf primordia using a dissecting microscope. 
Presence of leaf primordia indicated a wild type phenotype. Mutants were selected based on lack 
of leaf primordia. Mutants were then harvested and hypocotyls removed leaving only tissue in 
the shoot region. Tissue was then flash frozen in liquid nitrogen and stored at -80°C. 

Control tissue was isolated from 5 day old Landsberg erecta seedlings grown in the same 
manner as above. Tissue from the shoot region was harvested in the same manner as the stm 
tissue, but only contained material from the 24°C, 16 hr light/8 hr dark long day cycle growth 
chamber. (EXPTREP: 108453) 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 rows, 
5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 8 days. Seedlings were carefully removed from 
the sand and the outer layers of leaf shealth removed. About 2 mm sections were cut and flash 
frozen in liquid nitrogen prior to storage at -80°C. The tissues above the shoot apices (-1 cm 
long) were cut, treated as above and used as control tissue. 

(g) Abscissic acid (ABA) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 4°C 
for 4 days to vernalize. They were then transferred to a growth chamber having grown 16 hr 
light/8 hr dark, 13,000 LUX, 70% humidity, and 20°C and watered twice a week with 1 L of IX 
Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200-250 mis of 
100 (xM ABA in a 0.02% solution of the detergent Silwet L-77. Whole seedlings, including 
roots, were harvested within a 15 to 20 minute time period at 1 hr and 6 hr after treatment, flash- 
frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 100 \\M ABA for treatment. Control plants were 
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treated with water. After 6 hr and 24 hr, aerial and root tissues were separated and flash frozen in 
liquid nitrogen prior to storage at -80°C. 



(h) Auxin Responsive 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 4°C for 
4 days to vernalize. They were then transferred to a growth chamber having 1 6 hr light/8 hr 
dark, 13,000 LUX, 70% humidity, 20°C and watered twice a week with 1 L of IX Hoagland's 
solution (recipe recited in Feldmann et al., (1987) Mol. Gen. Genet. 208: 1-9 and described as 
complete nutrient solution). Approximately 1,000 14 day old plants were spayed with 200-250 
mis of 100 \\M NAA in a 0.02% solution of the detergent Silwet L-77. Aerial tissues (everything 
above the soil line) were harvested within a 1 5 to 20 minute time period 1 hr and 6 hrs after 
treatment, flash-frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 rows, 
5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 100 \xM NAA for treatment. Control plants were 
treated with water. After 6 hr and 24 hr, aerial and root tissues were separated and flash frozen in 
liquid nitrogen prior to storage at -80°C. 

(i) Cytokinin 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 4°C 
for 4 days to vernalize. They were then transferred to a growth chamber having 1 6 hr light/8 hr 
dark, 13,000 LUX, 70% humidity, 20°C temperature and watered twice a week with 1 L of IX 
Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200-250 mis of 
100 (iM BA in a 0.02% solution of the detergent Silwet L-77. Aerial tissues (everything above 
the soil line) were harvested within a 15 to 20 minute time period 1 hr and 6 hrs after treatment, 
flash-frozen in liquid nitrogen and stored at -80°C. 
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Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
5 the sand and placed in 1 -liter beakers with 1 00 |iM B A for treatment. Control plants were treated 
with water. After 6 hr, aerial and root tissues were separated and flash frozen in liquid nitrogen 
prior to storage at -80°C. 

(j) Brassinosteroid Responsive 

1 0 Two separate experiments were performed, one with epi-brassinolide and one with the 

^ brassinosteroid biosynthetic inhibitor brassinazole. 

C- In the epi-brassinolide experiments, seeds of wild-type Arabidopsis thaliana (ecotype 

W 

gj 5 } Wassilewskija) and the brassinosteroid biosynthetic mutant dwf\-\ were sown in trays and left at 
jj ] 4°C for 4 days to vernalize. They were then transferred to a growth chamber having 1 6 hr light/8 
$5 hr dark, 1 1,000 LUX, 70% humidity and 22°C temperature. Four week old plants were spayed 
p with a 1 jaM solution of epi-brassinolide and shoot parts (unopened floral primordia and shoot 

apical meristems) harvested three hours later. Tissue was flash-frozen in liquid nitrogen and 
: p stored at -80°C.(EXPT REP 1 08480) 

In the brassinazole experiments, seeds of wild-type Arabidopsis thaliana (ecotype 
20 Wassilewskija) were grown as described above. Four week old plants were spayed with a 1 jjM 

solution of brassinazole and shoot parts (unopened floral primordia and shoot apical meristems) 

harvested three hours later. Tissue was flash-frozen in liquid nitrogen and stored at -80°C.(EXPT 

REP 108481) 

In addition to the spray experiments, tissue was prepared from two different mutants; (1) 
25 a dwfA-X knock out mutant (EXPT REP: 108478) and (2) a mutant overexpressing the dwf4-\ 
gene (EXPT REP: 108479). 

Seeds of wild-type Arabidopsis thaliana (ecotype Wassilewskija) and of the dwfA-\ 
knock out and overexpressor mutants were sown in trays and left at 4°C for 4 days to vernalize. 
They were then transferred to a growth chamber having 16 hr light/8 hr dark, 1 1,000 LUX, 70% 
30 humidity and 22°C temperature. Tissue from shoot parts (unopened floral primordia and shoot 
apical meristems) was flash-frozen in liquid nitrogen and stored at -80°C. 
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Another experiment was completed with seeds of Arabidopsis thaliana (ecotype 
Wassilewskija) were sown in trays and left at 4°C for 4 days to vernalize. They were then 
transferred to a growth chamber. Plants were grown under long-day (16 hr light: 8 hr. dark) 
conditions, 13,000 LUX light intensity, 70% humidity, 20°C temperature and watered twice a 
week with 1 L IX Hoagland's solution(recipe recited in Feldmann et al., (1987) Mol. Gen. 
Genet. 208: 1-9 and described as complete nutrient solution). Approximately 1,000 14 day old 
plants were spayed with 200-250 mis of 0.1 \xM Epi-Brassinolite in 0.02% solution of the 
detergent Silwet L-77. At 1 hr. and 6 hrs. after treatment aerial tissues were harvested within a 15 
to 20 minute time period and flash-frozen in liquid nitrogen. 



Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
jjj having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 

Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
||5 the sand and placed in 1 -liter beakers with 0.1 jiM epi-brassinolide for treatment. Control plants 
q were treated with distilled deionized water. After 24 hr, aerial and root tissues were separated 
^ and flash frozen in liquid nitrogen prior to storage at -80°C. 



20 (k) Gibberillic Acid 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and 
left at 4°C for 4 days to vernalize. They were then transferred to a growth chamber having 16 hr 
light/8 hr. dark, 13,000 LUX, 70% humidity, 20°C and watered twice a week with 1 L of IX 
Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200-250 mis of 
25 100 \xM gibberillic acid in a 0.02% solution of the detergent Silwet L-77. At 1 hr. and 6 hrs. after 
treatment, aerial tissues (everything above the soil line) were harvested within a 15 to 20 minute 
time period, flash-frozen in liquid nitrogen and stored at -80°C. 

Alternatively, seeds of Arabidopsis thaliana (ecotype Ws) were sown in Metro-mix soil 
type 350 and left at 4°C for 3 days to vernalize. They were then transferred to a growth chamber 
30 having 16 hr light/8 hr dark, 13,000 LUX, 80% humidity, 20°C temperature and watered every 
four days with 1 .5 L water. 14 days after germination, plants were sprayed with 100 jiM 
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gibberillic acid or with water. Aerial tissues were harvested 1 hr (EXPT REP: 108484), 6 hrs 
(EXPT REP: 108485), 12 hrs (EXPT REP: 108486), and 24 hrs post-treatment, flash frozen and 
stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 100 |uM gibberillic acid for treatment. Control plants 
were treated with water. After 1 hr, 6 hr and 12 hr, aerial and root tissues were separated and 
flash frozen in liquid nitrogen prior to storage at -80°C. 

(1) Nitrogen: High to Low 

Wild type Arabidopsis thaliana seeds (ecotpye Ws) were surface sterilized with 30% 
Clorox, 0.1% Triton X-100 for 5 minutes. Seeds were then rinsed with 4-5 exchanges of sterile 
double distilled deionized water. Seeds were vernalized at 4°C for 2-4 days in darkness. After 
cold treatment, seeds were plated on modified IX MS media (without NH4NO3 or KNCb), 0.5% 
sucrose, 0.5g/L MES pH5.7, 1% phytagar and supplemented with KNO3 to a final concentration 
of 60 mM (high nitrate modified IX MS media). Plates were then grown for 7 days in a Percival 
growth chamber at 22°C with 16 hr. light/8 hr dark. 

Germinated seedlings were then transferred to a sterile flask containing 50 mL of high 
nitrate modified IX MS liquid media. Seedlings were grown with mild shaking for 3 additional 
days at 22°C in 16 hr. light/8 hr dark (in a Percival growth chamber) on the high nitrate modified 
IX MS liquid media. 

After three days of growth on high nitrate modified IX MS liquid media, seedlings were 
transferred either to a new sterile flask containing 50 mL of high nitrate modified IX MS liquid 
media or to low nitrate modified IX MS liquid media (containing 20 DM KNO3). Seedlings 
were grown in these media conditions with mild shaking at 22°C in 16 hr light/ 8 hr dark for the 
appropriate time points and whole seedlings harvested for total RNA isolation via the Trizol 
method (LifeTech.). The time points used for the microarray experiments were 10 min. (EXPT 
REP: 108454) and 1 hour (EXPT REP: 108455) time points for both the high and low nitrate 
modified IX MS media. 
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Alternatively, seeds that were surface sterilized in 30% bleach containing 0.1% Triton X- 
100 and further rinsed in sterile water, were planted on MS agar, (0.5% sucrose) plates 
containing 50 mM KNO3 (potassium nitrate). The seedlings were grown under constant light 
(3500 LUX) at 22°C. After 12 days, seedlings were transferred to MS agar plates containing 
5 either ImM KNO3 or 50 mM KNO3. Seedlings transferred to agar plates containing 50 mM 
KNO3 were treated as controls in the experiment. Seedlings transferred to plates with ImM 
KNO3 were rinsed thoroughly with sterile MS solution containing 1 mM KNO3. There were ten 
plates per transfer. Root tissue was collected and frozen in 15 mL Falcon tubes at various time 
points which included 1 hour, 2 hours, 3 hours, 4 hours, 6 hours, 9 hours, 12 hours, 16 hours, and 
10 24 hours. 

O Maize 35A19 Pioneer hybrid seeds were sown on flats containing sand and grown in a 

Conviron growth chamber at 25°C, 16 hr light/8 hr dark, -13,000 LUX and 80% relative humidity. 
Plants were watered every three days with double distilled deionized water. Germinated seedlings 
W are allowed to grow for 10 days and were watered with high nitrate modified IX MS liquid media 
Jj^ (see above). On day 11, young corn seedlings were removed from the sand (with their roots intact) 
^ and rinsed briefly in high nitrate modified IX MS liquid media. The equivalent of half a flat of 
03 seedlings were then submerged (up to their roots) in a beaker containing either 500 mL of high or 
low nitrate modified IX MS liquid media (see above for details). 

At appropriate time points, seedlings were removed from their respective liquid media, 
20 the roots separated from the shoots and each tissue type flash frozen in liquid nitrogen and stored 
at -80°C. This was repeated for each time point. Total RNA was isolated using the Trizol method 
(see above) with root tissues only. 

Corn root tissues isolated at the 4 hr and 16 hr time points were used for the microarray 
experiments. Both the high and low nitrate modified IX MS media were used. 

25 

(m) Nitrogen: Low to High 

Arabidopsis thaliana ecotype Ws seeds were sown on flats containing 4 L of a 1 :2 
mixture of Grace Zonolite vermiculite and soil. Flats were watered with 3 L of water and 
vernalized at 4°C for five days. Flats were placed in a Conviron growth chamber having 1 6 hr 
30 light/8 hr dark at 20°C, 80% humidity and 17,450 LUX. Flats were watered with approximately 
1.5 L of water every four days. Mature, bolting plants (24 days after germination) were bottom 
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treated with 2 L of either a control (100 mM mannitol pH 5.5) or an experimental (50 mM 
ammonium nitrate, pH 5.5) solution. Roots, leaves and siliques were harvested separately 30, 
120 and 240 minutes after treatment, flash frozen in liquid nitrogen and stored at -80°C. 

Hybrid maize seed (Pioneer hybrid 35A19) were aerated overnight in deionized water. 
Thirty seeds were plated in each flat, which contained 4 liters of Grace zonolite vermiculite. Two 
liters of water were bottom fed and flats were kept in a Conviron growth chamber with 16 hr 
light/8 hr dark at 20°C and 80% humidity. Flats were watered with 1 L of tap water every three 
days. Five day old seedlings were treated as described above with 2 L of either a control (100 
mM mannitol pH 6.5) solution or 1 L of an experimental (50 mM ammonium nitrate, pH 6.8) 
solution. Fifteen shoots per time point per treatment were harvested 10, 90 and 180 minutes after 
treatment, flash frozen in liquid nitrogen and stored at -80°C. 

Alternatively, seeds of Arabidopsis thaliana (ecotype Wassilewskija) were left at 4°C for 
3 days to vernalize. They were then sown on vermiculite in a growth chamber having 1 6 hours 
light/8 hours dark, 12,000-14,000 LUX, 70% humidity, and 20°C. They were bottom-watered 
with tap water, twice weekly. Twenty-four days old plants were sprayed with either water 
(control) or 0.6% ammonium nitrate at 4 jaL/cm 2 of tray surface. Total shoots and some primary 
roots were cleaned of vermiculite, flash-frozen in liquid nitrogen and stored at -80°C. 

(n) Methyl Jasmonate 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 4°C 
for 4 days to vernalize before being transferred to a growth chamber having 1 6 hr light/8 hr. 
dark, 13,000 LUX, 70% humidity, 20°C temperature and watered twice a week with 1 L of a IX 
Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200-250 mis of 
0.001% methyl jasmonate in a 0.02% solution of the detergent Silwet L-77. At 1 hr and 6 hrs 
after treatment, whole seedlings, including roots, were harvested within a 1 5 to 20 minute time 
period, flash-frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 0.001% methyl jasmonate for treatment. Control 
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plants were treated with water. After 24 hr, aerial and root tissues were separated and flash 
frozen in liquid nitrogen prior to storage at -80°C. 

(o) Salicylic Acid 

5 Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 4°C 

for 4 days to vernalize before being transferred to a growth chamber having 1 6 hr light/8 hr. 
dark, 13,000 LUX, 70% humidity, 20°C temperature and watered twice a week with 1 L of a IX 
Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200-250 mis of 5 
mM salicylic acid (solubilized in 70% ethanol) in a 0.02% solution of the detergent Silwet L-77. 
10 At 1 hr and 6 hrs after treatment, whole seedlings, including roots, were harvested within a 1 5 to 
O 20 minute time period flash-frozen in liquid nitrogen and stored at -80°C. 
-sjp Alternatively, seeds of wild-type Arabidopsis thaliana (ecotype Columbia) and mutant 

^ C S3 726 were sown in soil type 200 mixed with osmocote fertilizer and Marathon insecticide and 
fl"! left at 4°C for 3 days to vernalize. Flats were incubated at room temperature with continuous 
j|Jj light. Sixteen days post germination plants were sprayed with 2 mM SA, 0.02% SilwettL-77 or 
control solution (0.02% SilwettL-77. Aerial parts or flowers were harvested 1 hr (EXPT REP: 

m 108471 and 108472), 4 hr (EXPT REP: 108469 and 108470), 6 hr (EXPT REP: 108440,) 24 hr 

111 

£ (EXPT REP: 108443, 107953 and 107960) and 3 weeks (EXPT REP: 108475, 108476) post- 

sis*?, 

treatment flash frozen and stored at -80 C. 

20 Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 

rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 2 mM SA for treatment. Control plants were treated 

25 with water. After 12 hr and 24 hr, aerial and root tissues were separated and flash frozen in liquid 
nitrogen prior to storage at -80°C. 

(p) Wounding 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
30 three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr dark, 
12,000-14,000 LUX, 70% humidity and 20°C. After 14 days, the leaves were wounded with 
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forceps. Aerial tissues were harvested 1 hour and 6 hours after wounding. Aerial tissues from 
unwounded plants served as controls. Tissues were flash-frozen in liquid nitrogen and stored at 
-80°C 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
5 rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were wounded (one leaf 
nicked by scissors) and placed in 1 -liter beakers of water for treatment. Control plants were 
treated not wounded. After 1 hr and 6 hr aerial and root tissues were separated and flash frozen 
10 in liquid nitrogen prior to storage at -80°C. 



i'jS (q) Drought stress 

^ Seeds of Arabidopsis thaliana (Wassilewskija) were sown in pots and left at 4°C for 



three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr dark, 
Ijfj 1 50,000-160,000 LUX, 20°C and 70% humidity. After 14 days, aerial tissues were cut and left to 
' m dry on 3 MM Whatman paper in a petri-plate for 1 hour and 6 hours. Aerial tissues exposed for 1 

hour and 6 hours to 3 MM Whatman paper wetted with IX Hoagland's solution served as 
;: p controls. Tissues were harvested, flash-frozen in liquid nitrogen and stored at -80°C. 
J*! 5 Alternatively, Arabidopsis thaliana (Ws) seed was vernalized at 4° C for 3 days before 

20 sowing in Metromix soil type 350. Flats were placed in a growth chamber with 23°C, 16 hr 

light/8 hr. dark, 80% relative humidity, -13,000 LUX for germination and growth. Plants were 
watered with 1-1.5 L of water every four days. Watering was stopped 16 days after germination 
for the treated samples, but continued for the control samples. Rosette leaves and stems (EXPT 
REP 108477, 108482 and 108483), flowers (see EXPT REP: 108473, 108474) and siliques were 
25 harvested 2 d, 3 d, 4 d, 5 d, 6 d and 7 d (EXPT REP: 108473) after watering was stopped. Tissue 
was flash frozen in liquid nitrogen and kept at -80 °C until RNA was isolated. Flowers and 
siliques were also harvested on day 8 from plants that had undergone a 7 d drought treatment 
followed by 1 day of watering (EXPT REP: 108474). Control plants (whole plants) were 
harvested after 5 weeks, flash frozen in liquid nitrogen and stored as above. 
30 Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 

rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
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having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in empty 1 -liter beakers at room temperature for treatment. Control plants 
were placed in water. After 1 hr, 6 hr, 12 hr and 24 hr aerial and root tissues were separated and 
5 flash frozen in liquid nitrogen prior to storage at -80°C. 

(r) Osmotic stress 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr dark, 
10 12,000-14,000 LUX, 20°C, and 70% humidity. After 14 days, the aerial tissues were cut and 
O placed on 3 MM Whatman paper in a petri-plate wetted with 20% PEG (polyethylene glycol-M r 
==|3 8,000) in IX Hoagland's solution. Aerial tissues on 3 MM Whatman paper containing IX 

it! 

ySj Hoagland's solution alone served as the control. Aerial tissues were harvested at 1 hour and 6 
jjjjj I hours after treatment, flash-frozen in liquid nitrogen and stored at -80°C. 
IjS Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 

^ rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
05 having 1 6 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 1 3,000-14,000 LUX. 

Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 20% PEG (polyethylene glycol-M r 8,000) for 
20 treatment. Control plants were treated with water. After 1 hr and 6 hr aerial and root tissues were 
separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
25 Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers with 150mM NaCl for treatment. Control plants were 
treated with water. After 1 hr, 6hr, and 24 hr aerial and root tissues were separated and flash 
frozen in liquid nitrogen prior to storage at -80°C. 
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(s) Heat Shock Treatment 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber with 16 hr light/8 hr dark, 
12,000-14,000 LUX, 70% humidity and 20°C. Fourteen day old plants were transferred to a 42°C 
5 growth chamber and aerial tissues were harvested 1 h and 6 h after transfer. Control plants were 
left at 20°C and aerial tissues were harvested. Tissues were flashfrozen in liquid nitrogen and 
stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
10 having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
! P Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
:|3 the sand and placed in 1 -liter beakers containing 42°C water for treatment. Control plants were 
pjj treated with water at 25°C. After 1 hr and 6 hr aerial and root tissues were separated and flash 
$1 frozen in liquid nitrogen prior to storage at -80°C. 

m 

L (t) Cold Shock Treatment 

;Jj Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 

three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr dark, 
12,000-14,000 LUX, 20°C and 70% humidity. Fourteen day old plants were transferred to a 4°C 

20 dark growth chamber and aerial tissues were harvested 1 hour and 6 hours later. Control plants 
were maintained at 20°C and covered with foil to avoid exposure to light. Tissues were flash- 
frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 

25 having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 

Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
the sand and placed in 1 -liter beakers containing 4°C water for treatment. Control plants were 
treated with water at 25°C. After 1 hr and 6 hr aerial and root tissues were separated and flash 
frozen in liquid nitrogen prior to storage at -80°C. 

30 



773 




PATENT 

Attorney Docket No. 2750-1481P 
(u) Oxidative Stress- Hydrogen Peroxide Treatment 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize, before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 12,000- 14,000 LUX, 20°C and 70% humidity. Fourteen day old plants were sprayed with 5 
5 mM H 2 0 2 (hydrogen peroxide) in a 0.02% Silwett L-77 solution. Control plants were sprayed 
with a 0.02% Silwett L-77 solution. Aerial tissues were harvested 1 hour and 6 hours after 
spraying, flash-frozen in liquid nitrogen and stored at ~80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
10 having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
O Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 
■ 5 the sand and placed in 1 -liter beakers with 5 mM H 2 0 2 for treatment. Control plants were treated 
with water. After 1 hr, 6 hr and 24 hr, aerial and root tissues were separated and flash frozen in 
liquid nitrogen prior to storage at -80°C. 

ry 

m 

(v) Nitric Oxide Treatment 

W Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 

ru 

^ three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr dark, 
H 12,000-14,000 LUX, 20°C and 70% humidity. Fourteen day old plants were sprayed with 5 mM 
20 sodium nitroprusside in a 0.02% Silwett L-77 solution. Control plants were sprayed with a 

0.02% Silwett L-77solution. Aerial tissues were harvested 1 hour and 6 hours after spraying, 

flash-frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 

rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
25 having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 

Covered flats were watered every three days for 7 days. Seedlings were carefully removed from 

the sand and placed in 1 -liter beakers with 5 mM nitroprusside for treatment. Control plants were 

treated with water. After 1 hr, 6 hr and 12 hr, aerial and root tissues were separated and flash 

frozen in liquid nitrogen prior to storage at -80°C. 

30 

(w) S4 Immature Buds, Inflorescence Meristem 
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Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
humidity, and 22°C temperature. Inflorescences containing immature floral buds [stages 1-12; 
5 Smyth et al., 1990] as well as the inflorescence meristem were harvested and flash frozen in 
liquid nitrogen. 

(x) S5 Flowers (Opened) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
10 for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
O grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
,J humidity, and 22°C temperature. Mature, unpollinated flowers [stages 12-14; Smyth et al. 1990] 
were harvested and flash frozen in liquid nitrogen. 

y 

(y) S6 Siliques (All Stages) 

CS Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 

till 

p for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
[3 grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
20 humidity, and 22°C temperature. Siliques bearing developing seeds containing post fertilization 
through pre-heart stage [0-72 hours after fertilization (HAF)], heart- through early curled 
cotyledon stage [72-120 HAF] and late-curled cotyledon stage [>120 HAF] embryos were 
harvested separately and pooled prior to RNA isolation in a mass ratio of 1 : 1 : 1 . The tissues were 
then flash frozen in liquid nitrogen. Description of the stages of Arabidopsis embryogenesis used 
25 were reviewed by Bowman (1 994). 

(z) ARABIDOPSIS ENDOSPERM 
mea/mea Fruits 0-10 mm 

Seeds of Arabidopsis thaliana heterozygous for the fertilization- independent 
30 endosperm! ifiel) [Ohad et al., 1996; ecotype Landsberg erecta (Ler)] were sown in pots and 

left at 4°C for two to three days to vernalize. Kiyosue et al. (1999) subsequently determined that 
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fiel was allelic to the gametophytic maternal effect mutant medea (Grossniklaus et al., 1998). 
Imbibed seeds were then transferred to a growth chamber. Plants were grown under long-day 
(16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% humidity, and 22°C 
temperature. 1-2 siliques (fruits) bearing developing seeds just prior to dessication [9 days after 

5 flowering (DAF)] were selected from each plant and were hand-dissected to identify wild-type, 
mea/+ heterozygotes, and mea/mea homozygous mutant plants. At this stage, homozygous 
mea/mea plants produce short siliques that contain >70% aborted seed and can be distinguished 
from those produced by wild-type (100% viable seed) and mea/+ heterozygous (50% viable 
seed) plants (Ohad et al., 1996; Grossniklaus et al., 1998; Kiyosue et al., 1999). Siliques 0-10 
10 mm in length containing developing seeds 0-9 DAF produced by homozygous mea/mea plants 

O were harvested and flash frozen in liquid nitrogen. 



Pods 0-10 mm (Control Tissue for Sample 70) 



i4J 

Ml 

^ Seeds of Arabidopsis thaliana heterozygous for the fertilization- independent 

nlj 

endosperm! (fiel) [Ohad et al., 1996; ecotype Landsberg erecta (Ler)] were sown in pots and 



left at 4°C for two to three days to vernalize. Kiyosue et al. (1999) subsequently determined that 



CS fiel was allelic to the gametophytic maternal effect mutant medea (Grossniklaus et al., 1998). 

11J 

Imbibed seeds were then transferred to a growth chamber. Plants were grown under long-day 
P (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% humidity, and 22°C 
20 temperature. 1-2 siliques (fruits) bearing developing seeds just prior to dessication [9 days after 
flowering (DAF)] were selected from each plant and were hand-dissected to identify wild-type, 
mea/+ heterozygotes, and mea/mea homozygous mutant plants. At this stage, homozygous 
mea/mea plants produce short siliques that contain >70% aborted seed and can be distinguished 
from those produced by wild-type (100% viable seed) and mea/+ heterozygous (50% viable 
25 seed) plants (Ohad et al., 1996; Grossniklaus et al., 1998; Kiyosue et al., 1999). Siliques 0-10 
mm in length containing developing seeds 0-9 DAF produced by segregating wild-type plants 
were opened and the seeds removed. The remaining tissues (pods minus seed) were harvested 
and flash frozen in liquid nitrogen. 



30 (aa) ARABIDOPSIS SEEDS 



776 



PATENT 

Attorney Docket No. 2750-1481P 



Fruits (pod + seed) 0-5 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 

5 humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 
from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 
used in this determination were summarized by Bowman (1994). Silique lengths were then 
determined and used as an approximate determinant for embryonic stage. Siliques 0-5 mm in 
10 length containing post fertilization through pre-heart stage [0-72 hours after fertilization (HAF)] 

O embryos were harvested and flash frozen in liquid nitrogen. 



Fruits(pod + seed) 5-10 mm 



m 

W Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 

%pl for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 

grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
03 humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 

ru 

,^ from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
jW represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 
20 used in this determination were summarized by Bowman (1994). Silique lengths were then 

determined and used as an approximate determinant for embryonic stage. Siliques 5-10 mm in 
length containing heart- through early upturned-U- stage [72-120 hours after fertilization (HAF)] 
embryos were harvested and flash frozen in liquid nitrogen. 

25 

Fruits(pod + seed) >10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
30 humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 
from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
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represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 
used in this determination were summarized by Bowman (1994). Silique lengths were then 
determined and used as an approximate determinant for embryonic stage. Siliques >10 mm in 
length containing green, late upturned-U- stage [>120 hours after fertilization (HAF)-9 days after 
flowering (DAF)] embryos were harvested and flash frozen in liquid nitrogen. 



Green Pods 5-10 mm (Control Tissue for Samples 72-74) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
1 0 for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 
from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 



m 

in 

lift used in this determination were summarized by Bowman (1994). Silique lengths were then 



20 



determined and used as an approximate determinant for embryonic stage. Green siliques 5-10 



03 mm in length containing developing seeds 72-120 hours after fertilization (HAF)] were opened 
s jg and the seeds removed. The remaining tissues (green pods minus seed) were harvested and flash 



frozen in liquid nitrogen. 



Green Seeds from Fruits >10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 

25 grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 
from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 
used in this determination were summarized by Bowman (1994). Silique lengths were then 

30 determined and used as an approximate determinant for embryonic stage. Green siliques >10 
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mm in length containing developing seeds up to 9 days after flowering (DAF)] were opened and 
the seeds removed and harvested and flash frozen in liquid nitrogen. 



5 Brown Seeds from Fruits >10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 
1 0 from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
O represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 
=Jj used in this determination were summarized by Bowman (1994). Silique lengths were then 
j; p determined and used as an approximate determinant for embryonic stage. Yellowing siliques 
m >io mm in length containing brown, dessicating seeds > 1 1 days after flowering (DAF)] were 
ijfj opened and the seeds removed and harvested and flash frozen in liquid nitrogen. 

IS 

ru 

r Green/Brow n Seeds from Fruits >10 mm 

. 

^ Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 

20 for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing seeds were selected 
from at least 3 plants and were hand-dissected to determine what developmental stage(s) were 
represented by the enclosed embryos. Description of the stages of Arabidopsis embryogenesis 
25 used in this determination were summarized by Bowman (1994). Silique lengths were then 
determined and used as an approximate determinant for embryonic stage. Green siliques >10 
mm in length containing both green and brown seeds >9 days after flowering (DAF)] were 
opened and the seeds removed and harvested and flash frozen in liquid nitrogen. 

30 
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Mature Seeds (24 hours after imbibition) 

Mature dry seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown onto 
moistened filter paper and left at 4°C for two to three days to vernalize. Imbibed seeds were then 
transferred to a growth chamber [16 hr light: 8 hr dark conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature], the emerging seedlings harvested after 48 hours 
and flash frozen in liquid nitrogen. 

Mature Seeds (Dry) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 4°C 
for two to three days to vernalize. They were then transferred to a growth chamber. Plants were 
grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% 
humidity, and 22°C temperature and taken to maturity. Mature dry seeds are collected, dried for 
one week at 28°C, and vernalized for one week at 4°C before used as a source of RNA. 

Ovules 

Seeds of Arabidopsis thaliana heterozygous for pistillata (pi) [ecotype Landsberg erecta 
(Ler)] were sown in pots and left at 4°C for two to three days to vernalize. They were then 
transferred to a growth chamber. Plants were grown under long-day (16 hr light: 8 hr dark) 
conditions, 7000-8000 LUX light intensity, 76% humidity, and 24°C temperature. 
Inflorescences were harvested from seedlings about 40 days old. The inflorescences were cut 
into small pieces and incubated in the following enzyme solution (pH 5) at room temperature for 
0.5-1 hr.: 0.2% pectolyase Y-23, 0.04% pectinase, 5 mM MES, 3% Sucrose and MS salts (1900 
mg/1 KN0 3 , 1650 mg/1 NH4NO3, 370 mg/1 MgS0 4 • 7 H 2 0, 170 mg/1 KH 2 P0 4 , 440 mg/1 CaCl 2 • 
2 H 2 0, 6.2 mg/1 H2BO3, 15.6 mg/1 MnS0 4 • 4 H 2 0, 8.6 mg/1 ZnS0 4 • 7 H 2 0, 0.25 mg/1 NaMo0 4 

• 2 H 2 0, 0.025 mg/1 CuC0 4 • 5 H 2 0, 0.025 mg/1 CoCl 2 • 6 H 2 0, 0.83 mg/1 KI, 27.8 mg/1 FeS0 4 

• 7 H 2 0, 37.3 mg/1 Disodium EDTA, pH 5.8). At the end of the incubation the mixture of 
inflorescence material and enzyme solution was passed through a size 60 sieve and then through 
a sieve with a pore size of 125 |um. Ovules greater than 125 |nm in diameter were collected, 
rinsed twice in B5 liquid medium (2500 mg/1 KNO 3 ,250 mg/1 MgS0 4 • 7 H 2 0, 150 mg/1 
NaH2P04 • H 2 0, 150 mg/1 CaCl 2 • 2 H 2 0, 134 mg/1 (NH4)2 CaCl 2 • S0 4 , 3 mg/1 H 2 B0 3 , 10 
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mg/1 MnS0 4 • 4 H 2 0, 2 ZnS0 4 • 7 H 2 0, 0.25 mg/1 NaMo0 4 • 2 H 2 0, 0.025 mg/1 CuC0 4 • 5 
H 2 0, 0.025 mg/1 CoCl 2 • 6 H 2 0, 0.75 mg/1 KI, 40 mg/1 EDTA sodium ferric salt, 20 g/1 sucrose, 
10 mg/1 Thiamine hydrochloride, 1 mg/1 Pyridoxine hydrochloride, 1 mg/1 Nicotinic acid, 100 
mg/1 myo-inositol, pH 5.5)), rinsed once in deionized water and flash frozen in liquid nitrogen. 
5 The supernatant from the 125 |nm sieving was passed through subsequent sieves of 50 jam and 32 
jam. The tissue retained in the 32 jam sieve was collected and mRNA prepared for use as a 
control. 

10 (bb) Herbicide treament 

Arabidopsis thaliana (Ws) seeds were sterilized for 5 min. with 30% bleach, 50 |il Triton 

; 0 in a total volume of 50 ml. Seeds were vernalized at 4°C for 3 days before being plated onto GM 

W 

Hjl agar plates at a density of about 144 seeds per plate. Plates were incubated in a Percival growth 

jjj chamber having 16 hr light/8 hr dark, 80% relative humidity, 22 °C and 1 1,000 LUX for 14 days. 
M Plates were sprayed (-0.5 mis/plate) with water, Finale (1 . 128 g/L), Glean (1 .88 g/L), 

i|g RoundUp (0.01 g/L) or Trimec (0.08 g/L). Tissue was collected and flash frozen in liquid 

J! nitrogen at the following time points: 0, 1, 2, 4 (EXPT REP: 107871 (Finale), 107881 (Glean), 

£ 107896 (Round-up) and 107886 (Trimec)), 8, 12(EXPT REP: 108467 (Finale), 108468 (Glean), 

C 108465 (Round-up) and 108466, 107891 (Trimec)), and 24 hours. Frozen tissue was stored at - 

20 80°C prior to RNA isolation. 

(cc) Ap2 

Seeds of Arabidopsis thaliana (ecotype Landesberg erecta) and floral mutant apetalal 
(Jofuku et al., 1994, Plant Cell 6:121 1-1225) were sown in pots and left at 4°C for two to three 
25 days to vernalize. They were then transferred to a growth chamber. Plants were grown under 

long-day (16 hr light, 8 hr dark) conditions 7000-8000 LUX light intensity, 70% humidity and 22 
°C temperature. Inflorescences containing immature floral buds (stages 1-7; Bowman, 1994) as 
wel as the inflorescence meristem were harvested and flashfrozen. Polysomal polyA+ RNA was 
isolated from tissue according to Cox and Goldberg, 1988). 

30 



(dd) Protein Degradation 
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Arabidopsis thaliana (ecotype Ws) wild-type and 13B 12-1 (homozygous) mutant seed 
were sown in pots containing Metro-mix 350 soil and incubated at 4°C for four days. Vernalized 
seeds were germinated in the greenhouse (16 hr light/8 hr dark) over a 7 day period. Mutant 
seedlings were sprayed with 0.02% (active ingredient) Finale to confirm their transgenic 
5 standing. Plants were grown until the mutant phenotype (either multiple pistils in a single flower 
and/or multiple branching per node) was apparent. Young inflorescences immediately forming 
from the multiple-branched stems were cut and flash frozen in liquid nitrogen. Young 
inflorescences from wild-type plants grown in parallel and under identical conditions were 
collected as controls. All collected tissue was stored at -80°C until RNA isolation. (EXPT REP 
10 108451) 

i,gj (ee) Root tips 

W 

Seeds of Arabidopsis thaliana (ecotye Ws) were placed on MS plates and vernalized at 

y * 

ITU 4°C for 3 days before being placed in a 25°C growth chamber having 16 hr light/8 hr dark, 70% 

relative humidty and about 3 W/m 2 . After 6 days, young seedlings were transferred to flasks 
S containing B5 liquid medium, 1% sucrose and 0.05 mg/1 indole-3-butyric acid. Flasks were 
fpy incubated at room temperature with 100 rpm agitation. Media was replaced weekly. After three 
:S weeks, roots were harvested and incubated for 1 hr with 2% pectinase, 0.2% cellulase, pH 7 
^ before straining through a #80 (Sigma) sieve. The root body material remaining on the sieve 
20 (used as the control) was flash frozen and stored at -80°C until use. The material that passed 

through the #80 sieve was strained through a #200 (Sigma) sieve and the material remaining on 
the sieve (root tips) was flash frozen and stored at -80°C until use. Approximately 10 mg of root 
tips were collected from one flask of root culture. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
25 rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth chamber 
having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000-14,000 LUX. 
Covered flats were watered every three days for 8 days. Seedlings were carefully removed from 
the sand and the root tips (-2 mm long) were removed and flash frozen in liquid nitrogen prior to 
storage at -80°C. The tissues above the root tips (-1 cm long) were cut, treated as above and 
30 used as control tissue. 
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(ff) rtl 

The rtl allele is a variation of rtl rootless 1 and is recessive. Plants displaying the rtl 
phenotype have few or no secondary roots. 
5 Seed from plants segregating for rtl were sown on sand and placed in a growth chamber 

having 16 hr light/8 hr dark, 13,000 LUX, 70% humidity and 20°C temperature. Plants were 
watered every three days with tap water. Eleven (11) day old seedlings were carefully removed 
from the sand, keeping the roots intact. rf7-type seedlings were separated from their wild-type 
counterparts and the root tissue isolated. Root tissue from normal seedlings (control) and rtl 
10 mutants were flash frozen in liquid nitrogen and stored at -80°C until use. 

J (gg) Imbibed seed 

5jj Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in covered flats 

Ql (10 rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
i£ chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 1 3,000-14,000 
^ LUX. One day after sowing, whole seeds were flash frozen in liquid nitrogen prior to storage at - 
03 80°C. Two days after sowing, embryos and endosperm were isolated and flash frozen in liquid 
\t nitrogen prior to storage at -80°C. On days 3-6, aerial tissues, roots and endosperm were isolated 
G and flash frozen in liquid nitrogen prior to storage at -80°C. 
20 

(hh) Rough Sheath2-R (rs2-R) Mutants (1400-6/S-17) 

This experiment was conducted to identify abnormally expressed genes in the shoot apex of 
rough sheath2-R (rs2-R) mutant plants. rs2 encodes a myb domain DNA binding protein that 
functions in repression of several shoot apical meristem expressed homeobox genes. Two 
25 homeobox gene targets are known for rs2 repression, rough sheath 1, liguleless J. The recessive 
loss of function phenotype of rs2-R homozygous plants is described in Schneeberger et ah 1998 
Development 125: 2857-2865. 

The seed stock genetically segregates 1:1 for rs2-R/rs2-R : rs2-R/+ 

Preparation of tissue samples: 160 seedlings pooled from 2 and 3 week old plants grown in 
30 sand. Growth conditions; Conviron #107 @ 12 hr days/12hr night, 25°C, 75% humidity. Shoot 
apex was dissected to include leaf three and older. (Pictures available upon request). 
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1) rough sheath2-R homozygous (mutant) shoot apex 

2) rough sheath2-R heterozygous (wt 5 control) shoot apex 

(ii) Leaf Mutant 3642: 

5 Mutant 3642 is a recessive mutation that causes abnormal leaf development. The leaves 

of mutant 3642 plants are characterized by leaf twisting and irregular leaf shape. Mutant 3642 
plants also exhibit abnormally shaped floral organs which results in reduced fertility. 

Seed segregating for the mutant phenotype was sown in Metro-mix 350 soil and 
grown in a Conviron growth chamber with watering by sub-irrigation twice a week. 
10 Environmental conditions were set at 20 degrees Celsius, 70% humidity with an 8 hour 



temperature under fluorescent lighting until flowering. Flowers were harvested after 28 days in 



20 three different categories. Buds that had not opened at all and were completely green were 

categorized as "flower buds" (also referred to as green buds by the investigator). Buds that had 
started to open, with white petals emerging slightly were categorized as "green flowers" (also 
referred to as white buds by the investigator). Flowers that had opened mostly (with no silique 
elongation) with white petals completely visible were categorized as "white flowers" (also 

25 referred to as open flowers by the investigator). Buds and flowers were harvested with forceps, 
flash frozen in liquid nitrogen and stored at -80C until RNA was isolated. 

2. Microarray Hybridization Procedures 

30 Microarray technology provides the ability to monitor mRNA transcript levels of 

thousands of genes in a single experiment. These experiments simultaneously hybridize two 




day, 16 hour night light regime. Plants were harvested after 4 weeks of growth and the 
entire aerial portion of the plant was harvested and immediately frozen in liquid nitrogen 
and stored at -80C. Mutant phenotype plants were harvested separately from normal 
phenotype plants, which serve as the control tissue. 




(jj) Flowers (green, white or buds) 

Approximately 10 \x\ of Arabidopsis thaliana seeds (ecotype Ws) were sown on 350 soil 
(containing 0.03% marathon) and vernalized at 4C for 3 days. Plants were then grown at room 
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differentially labeled fluorescent cDNA pools to glass slides that have been previously spotted 
with cDNA clones of the same species. Each arrayed cDNA spot will have a corresponding ratio 
of fluorescence that represents the level of disparity between the respective mRNA species in the 
two sample pools. Thousands of polynucleotides can be spotted on one slide, and each 
5 experiment generates a global expression pattern. 

COATING SLIDES 

The microarray consists of a chemically coated microscope slide, referred herein as a "chip" with 
numerous polynucleotide samples arrayed at a high density. The poly-L-lysine coating allows 
10 for this spotting at high density by providing a hydrophobic surface, reducing the spreading of 
Q spots of DNA solution arrayed on the slides. Glass microscope slides (Gold Seal #3010 
Jfj manufactured by Gold Seal Products, Portsmouth, New Hampshire, USA) were coated with a 
Jj*j 0.1 %W/V solution of Poly-L-lysine (Sigma, St. Louis, Missouri) using the following protocol: 

m 
m 

%1p Slides were placed in slide racks (Shandon Lipshaw #121). The racks were then put in chambers 
(Shandon Lipshaw #121). 

$5 Cleaning solution was prepared: 

m 

t 70 g NaOH was dissolved in 280 mL ddH20. 

p 420 mL 95% ethanol was added. The total volume was 700 mL (= 2 X 350 mL); it was stirred 
20 until completely mixed. 

If the solution remained cloudy, ddHiO was added until clear. 

3. The solution was poured into chambers with slides; the chambers were covered with glass 
lids. The solution was mixed on an orbital shaker for 2 hr. 

4. The racks were quickly transferred to fresh chambers filled with ddLbO. They were rinsed 
25 vigorously by plunging racks up and down. 

Rinses were repeated 4X with fresh ddH 2 0 each time, to remove all traces of NaOH-ethanol. 

5. Poly lysine solution was prepared: 

70 mL poly-L-lysine + 70 mL tissue culture PBS in 560 mL water, using plastic graduated 
cylinder and beaker. 
30 6. Slides were transferred to polylysine solution and shaken for 1 hr. 



785 



PATENT 

Attorney Docket No. 2750-1481P 

7. The rack was transferred to a fresh chambers filled with ddH 2 0. It was plunged up and 
down 5X to rinse. 

8. The slides were centrifuged on microtiter plate carriers (paper towels were placed below the 
rack to absorb liquid) for 5 min. @ 500 rpm. The slide racks were transferred to empty 
chambers with covers. 

9. Slide racks were dried in a 45C oven for 10 min. 

10. The slides were stored in a closed plastic slide box. 

1 1 . Normally, the surface of lysine coated slides was not very hydrophobic immediately after this 
process, but became increasingly hydrophobic with storage. A hydrophobic surface helped 
ensure that spots didn't run together while printing at high densities. After they aged for 10 days 
to a month the slides were ready to use. However, coated slides that have been sitting around for 
long periods of time were usually too old to be used. This was because they developed opaque 
patches, visible when held to the light, and these resulted in high background hybridization from 
the fluorescent probe. 

Alternativey, precoated glass slides were purchased from TeleChem Internation, Inc. 
(Sunnyvale, CA, 94089; catalog number SMM-25, Superamine substrates). 

PCR AMPLIFICATION OF CDNA CLONE INSERTS 
Polynucleotides were amplified from Arabidopsis cDNA clones using insert specific probes. 
The resulting lOOuL PCR reactions were purified with Qiaquick 96 PCR purification columns 
(Qiagen, Valencia, California, USA) and eluted in 30 uL of 5mM Tris. 8.5uL of the elution were 
mixed with 1.5uL of 20X SSC to give a final spotting solution of DNA in 3X SSC. The 
concentrations of DNA generated from each clone varied between 10-100 ng/ul, but were usually 
about 50 ng/ul. 

ARRA YING OF PCR PRODUCTS ON GLASS SLIDES 

PCR products from cDNA clones were spotted onto the poly-L-Lysine coated glass slides using 
an arrangement of quill-tip pins (ChipMaker 3 spotting pins; Telechem, International, Inc., 
Sunnyvale, California, USA) and a robotic arrayer (PixSys 3500, Cartesian Technologies, Irvine, 
California, USA). Around 0.5 nl of a prepared PCR product was spotted at each location to 
produce spots with approximately lOOum diameters. Spot center-to-center spacing was from 180 
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um to 210um depending on the array. Printing was conducted in a chamber with relative 
humidity set at 50%. 

Slides containing maize sequences were purchased from Agilent Technology (Palo Alto, 
CA 94304). 

5 

POST-PROCESSING OF SLIDES 

After arraying, slides were processed through a series of steps - rehydration, UV cross-linking, 
blocking and denaturation - required prior to hybridization. Slides were rehydrated by placing 
them over a beaker of warm water (DNA face down), for 2-3 sec, to distribute the DNA more 
10 evenly within the spots, and then snap dried on a hot plate (DNA side, face up). The DNA was 
P then cross-linked to the slides by UV irradiation (60-65mJ; 2400 Stratalinker, Stratagene, La 
Jolla, California, USA). 

w 

U1 

fjl Following this a blocking step was performed to modify remaining free lysine groups, and hence 
ijjj minimize their ability to bind labeled probe DNA. To achieve this the arrays were placed in a 

slide rack. An empty slide chamber was left ready on an orbital shaker. The rack was bent 
!jg slightly inwards in the middle, to ensure the slides would not run into each other while shaking. 

The blocking solution was prepared as follows: 
O 3x 350-ml glass chambers (with metal tops) were set to one side, and a large round Pyrex dish 
20 with dH 2 0 was placed ready in the microwave. At this time, 15ml sodium borate was prepared 

in a 50 ml conical tube. 

6-g succinic anhydride was dissolved in approx. 325-350 mL l-methyl-2-pyrrolidinone. Rapid 
addition of reagent was crucial. 

a. Immediately after the last flake of the succinic anhydride dissolved, the 15-mL sodium borate 
25 was added. 

b. Immediately after the sodium borate solution mixed in, the solution was poured into an empty 
slide chamber. 

c. The slide rack was plunged rapidly and evenly in the solution. It was vigorously shaken up and 
down for a few seconds, making sure slides never left the solution. 

30 d. It was mixed on an orbital shaker for 15-20 min. Meanwhile, the water in the Pyrex dish 
(enough to cover slide rack) was heated to boiling. 
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Following this, the slide rack was gently plunge in the 95C water (just stopped boiling) for 2 
min. Then the slide rack was plunged 5X in 95% ethanol. The slides and rack were centrifuged 
for 5 min. @ 500 rpm. The slides were loaded quickly and evenly onto the carriers to avoid 
streaking. The arrays were used immediately or store in slide box. 

5 

The Hybridization process began with the isolation of mRNA from the two tissues (see 
"Isolation of total RNA " and "Isolation of mRNA below) in question followed by their 
conversion to single stranded cDNA (see "Generation of probes for hybridization " below). The 
cDNA from each tissue was independently labeled with a different fluorescent dye and then both 
10 samples were pooled together. This final differentially labeled cDNA pool was then placed on a 
Q processed microarray and allowed to hybridize (see "Hybridization and wash conditions " 
^ below). 



ffi transferred into a 50-ml centrifuge tube containing 10 ml of Trizol reagent. The tube was 
£ vigorously vortexed for 1 min and then incubated at room temperature for 10-20 min. on an 

Q orbital shaker at 220 rpm. Two ml of chloroform was added to the tube and the solution 

20 vortexed vigorously for at least 30-sec before again incubating at room temperature with 

shaking. The sample was then centrifuged at 12,000 X g (10,000 rpm) for 15-20 min at 4°C. The 
aqueous layer was removed and mixed by inversion with 2.5 ml of 1.2 M NaCl/0.8 M Sodium 
Citrate and 2.5 ml of isopropyl alcohol added. After a 10 min. incubation at room temperature, 
the sample was centrifuged at 12,000 X g (10,000 rpm) for 15 min at 4°C. The pellet was 

25 washed with 70% ethanol, re-centrifuged at 8,000 rpm for 5 min and then air dried at room 

temperature for 10 min. The resulting total RNA was dissolved in either TE (10 mM Tris-HCl, 1 
mM EDTA, pH 8.0) or DEPC (diethylpyrocarbonate) treated deionized water (RNAse-free 
water). For subsequent isolation of mRNA using the Qiagen kit, the total RNA pellet was 
dissolved in RNAse-free water. 



ISOLATION OF TOTAL RNA 



Approximately 1 g of plant tissue was ground in liquid nitrogen to a fine powder and 



30 



ISOLA TION OF mRNA 
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mRNA was isolated using the Qiagen Oligotex mRNA Spin-Column protocol (Qiagen, 
Valencia,California). Briefly, 500 ul OBB buffer (20 mM Tris-Cl, pH 7.5, 1 M NaCl, 2 mM 
EDTA, 0.2% SDS) was added to 500 ul of total RNA (0.5 - 0.75 mg) and mixed thoroughly. 
The sample was first incubated at 70°C for 3 min, then at room temperature for 10 minutes and 
finally centrifuged for 2 min at 14,000 - 18,000 X g. The pellet was resuspended in 400 ul OW2 
buffer (10 mM Tris-Cl, pH 7.5, 150 mM NaCl, 1 mM EDTA) by vortexing, the resulting 
solution placed on a small spin column in a 1.5 ml RNase-free microcentrifuge tube and 
centrifuged for 1 min at 14,000 - 1 8,000 X g. The spin column was transferred to a new 1 .5 ml 
RNase-free microcentrifuge tube and washed with 400 ul of OW2 buffer. To release the isolated 
mRNA from the resin, the spin column was again transferred to a new RNase-free 1 .5 ml 
microcentrifuge tube, 20-100 ul 70°C OEB buffer (5 mM Tris-Cl, pH 7.5) added and the resin 
resuspended in the resulting solution via pipeting. The mRNA solution was collected after 
centrifuging for 1 min at 14,000 - 18,000 X g. 

Alternatively, mRNA was isolated using the Stratagene Poly(A) Quik mRNA Isolation 
Kit (Startagene, La Jolla, California). Here, up to 0.5 mg of total RNA (maximum volume of 1 
ml) was incubated at 65°C for 5 minutes, snap cooled on ice and 0.1 X volumes of 10X sample 
buffer (lOmM Tris-HCl (pH 7.5), 1 mM EDTA (pH 8.0) 5 M NaCl) added. The RNA sample 
was applied to a prepared push column and passed through the column at a rate of ~1 drop every 
2 sec. The solution collected was reapplied to the column and collected as above. 200 ul of high 
salt buffer (10 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.5 NaCl) was applied to the column and 
passed through the column at a rate of ~1 drop every 2 sec. This step was repeated and followed 
by three low salt buffer (10 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0. 1 M NaCl) washes 
preformed in a similar manner. mRNA was eluted by applying to the column four separate 200 
ul aliquots of elution buffer (10 mM Tris-HCl (pH 7.5), 1 mM EDTA) preheated to 65°C. Here, 
the elution buffer was passed through the column at a rate of 1 drop/sec. The resulting mRNA 
solution was precipitated by adding 0.1X volumes of 10X sample buffer, 2,5 volumes of ice-cold 
100% ethanol, incubating overnight at -20°C and centrifuging at 14,000-18,000 X g for 20-30 
min at 4°C. The pellet was washed with 70% ethanol and air dried for 10 min. at room 
temperature before resuspension in RNase-free deionized water. 

PREPARATION OF YEAST CONTROLS 
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Plasmid DNA was isolated from the following yeast clones using Qiagen filtered 
maxiprep kits (Qiagen, Valencia, California): YAL022c(Fun26), YAL031c(Fun21), YBR032w, 
YDL131w, YDL182w, YDL194w, YDL196w, YDR050c and YDR1 16c. Plasmid DNA was 
linearized with either BsrBl (YAL022c(Fun26), YAL031c(Fun21), YDL131w, YDL182w, 
5 YDL1 94w, YDL 1 96w, YDR050c) or Afllll (YBR032w, YDR1 1 6c) and isolated. 

In Vitro Transcription of Yeast Clones 

The following solution was incubated at 37°C for 2 hours: 17 ul of isolated yeast insert 
DNA (1 ug), 20 ul 5X buffer, 10 ul 100 mM DTT, 2.5 ul (100 U) RNasin, 20 ul 2.5 mM (ea.) 
10 rNTPs, 2.7 ul (40U) SP6 polymerase and 27.8 ul RNase-free deionized water. 2 ul (2 U) Ampli 

y DNase I was added and the incubation continued for another 15 min. 10 [il 5M NH4OAC and 
l -Q 100 jil phenol:chloroform:isoamyl alcohol (25:24:1) were added, the solution vortexed and then 
01 centrifliged to separate the phases. To precipitate the RNA, 250 \i\ ethanol was added and the 
P|i solution incubated at -20 C for at least one hour. The sample was then centrifuged for 20 min at 
t& 4°C at 14,000-18,000 X g, the pellet washed with 500 |al of 70% ethanol, air dried at room 

C3 temperature for 1 0 min and resuspended in 1 00 |il of RNase-free deionized water. The 

S3 

ify precipitation procedure was then repeated. 

Alternatively, after the two-hour incubation, the solution was extracted with 
IM* phenol/chloroform once before adding 0.1 volume 3M sodium acetate and 2.5 volumes of 100% 
20 ethanol. The solution was centrifuged at 15,000rpm, 4°C for 20 minutes and the pellet 

resuspended in RNase-free deionized water. The DNase I treatment was carried out at 37°C for 
30 minutes using 2 U of Ampli DNase I in the following reaction condition: 50 mM Tris-HCl 
(pH 7.5), 10 mM MgCb . The DNase I reaction was then stopped with the addition of NH4OAC 
and phenol: chloroform :isoamyl alcohol (25:24:1), and RNA isolated as described above. 
25 0. 1 5-2.5 ng of the in vitro transcript RNA from each yeast clone were added to each plant 

mRNA sample prior to labeling to serve as positive (internal) probe controls. 

GENERATION OF PROBES FOR HYBRIDIZATION 
Generation of labeled probes for hybridization from first-strand cDNA 
30 Hybridization probes were generated from isolated mRNA using an Atlas™ Glass 

Fluorescent Labeling Kit (Clontech Laboratories, Inc., Palo Alto, California, USA). This entails 
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a two step labeling procedure that first incorporates primary aliphatic amino groups during 
cDNA synthesis and then couples fluorescent dye to the cDNA by reaction with the amino 
functional groups. Briefly, 5 pg of oligo(dT)i 8 primer d(TTTTTTTTTTTTTTTTTTV) was 
mixed with Poly A+ mRNA (1 .5 - 2 pg mRNA isolated using the Qiagen Oligotex mRNA Spin- 
5 Column protocol or-the Stratagene Poly(A) Quik mRNA Isolation protocol (Stratagene, La Jolla, 
California, USA)) in a total volume of 25 pi. The sample was incubated in a thermocycler at 
70°C for 5 min, cooled to 48°C and 10 pi of 5X cDNA Synthesis Buffer (kit supplied), 5 pi 10X 
dNTP mix (dATP, dCTP, dGTP, dTTP and aminoallyl-dUTP; kit supplied), 7.5 pi deionized 
water and 2.5 pi MMLV Reverse Transcriptase (500U) added. The reaction was then incubated 
10 at 48°C for 30 minutes, followed by lhr incubation at 42°C. At the end of the incubation the 
:=fj reaction was heated to 70°C for 10 min, cooled to 37°C and 0.5 pi (5 U) RNase H added, before 
incubating for 1 5 min at 37°C. The solution was vortexed for 1 min after the addition of 0.5 pi 
?{! 0.5 M EDTA and 5 pi of QuickClean Resin (kit supplied) then centrifuged at 14,000-18,000 X g 
irU for 1 min. After removing the supernatant to a 0.45 pm spin filter (kit supplied), the sample was 
15 again centrifuged at 14,000-18,000 X g for 1 min, and 5.5 pi 3 M sodium acetate and 137.5 pi of 

100% ethanol added to the sample before incubating at -20°C for at least 1 hr. The sample was 
W then centrifuged at 14,000-1 8,000 X g at 4°C for 20 min, the resulting pellet washed with 500 pi 
ip 70% ethanol, air-dried at room temperature for 10 min and resuspended in 10 pi of 2X 

fluorescent labeling buffer (kit provided). 10 pi each of the fluorescent dyes Cy3 and Cy5 
20 (Amersham Pharmacia (Piscataway, New Jersey, USA); prepared according to Atlas™ kit 

directions of Clontech) were added and the sample incubated in the dark at room temperature for 
30 min. 

The fluorescently labeled first strand cDNA was precipitated by adding 2 pi 3M sodium 
acetate and 50 pi 100% ethanol, incubated at -20°C for at least 2 hrs, centrifuged at 14,000- 
25 18,000 X g for 20 min, washed with 70% ethanol, air-dried for 10 min and dissolved in 100 pi 
of water. 

Alternatively, 3-4 pg mRNA, 2.5 (-8.9 ng of in vitro translated mRNA) pi yeast control 
and 3 pg oligo dTV (TTTTTTTTTTTTTTTTTT( A/C/G) ; Sequence ID No.: X) were mixed in a 
total volume of 24.7 pi. The sample was incubated in a thermocycler at 70°C for 10 min. before 
30 chilling on ice. To this, 8 pi of 5X first strand buffer ( Superscript II RNase H- Reverse 
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Transcriptase kit from Invitrogen (Carlsbad, California 92008); cat no. 18064022), 0.8 °C of aa- 
dUTP/dNTP mix (SOX; 25mM dATP, 25mM dGTP, 25mM dCTP, 15mM dTTP, lOmM 
aminoallyl-dUTP), 4 jal of 0.1 M DTT and 2.5 jal (500 units) of Superscript R.T.II enzyme 
(Stratagene) were added. The sample was incubated at 42°C for 2 hours before a mixture of 10 
5 °C of 1M NaOH and 10°C of 0.5 M EDTA were added. After a 15 minute incubation at 65°C, 25 
\x\ of 1 M Tris pH 7.4 was added. This was mixed with 450 \il of water in a Microcon 30 column 
before centrifugation at 1 1,000 X g for 12 min. The column was washed twice with 450 jj! 
(centrifugation at 1 1,000 g, 12 min.) before eluting the sample by inverting the Microcon column 
and centrifiiging at 1 1,000 X g for 20 seconds. Sample was dehydrated by centrifugation under 
10 vacuum and stored at -20°C. 
^ Each reaction pellet was disolved in 9 jul of 0.1 M carbonate buffer (0.1M sodium 

;^ carbonate and sodium bicarbonate, pH=8.5-9) and 4.5 jil of this placed in two microfuge tubes. 

w 

U] 4.5 jlxI of each dye (in DMSO) were added and the mixture incubated in the dark for 1 hour. 4.5 

m 

jflj jil of 4 M hydroxylamine was added and again incubated in the dark for 15 minutes. 
1 y 1 Regardless of the method used for probe generation, the probe was purified using a 

: S Qiagen PCR cleanup kit (Qiagen, Valencia, California, USA), and eluted with 100 ul EB (kit 

m 

flij provided). The sample was loaded on a Microcon YM-30 (Millipore, Bedford, Massachusetts, 
% USA) spin column and concentrated to 4-5 ul in volume. Probes for the maize microarrays 

were generated using the Fluorescent Linear Amplification Kit (cat. No. G2556A) from Agilent 
20 Technologies (Palo Alto, CA). 

HYBRIDIZATION AND WASH CONDITIONS 

The following Hybridization and Washing Condition were developed: 

25 Hybridization Conditions : 

Labeled probe was heated at 95°C for 3 min and chilled on ice. Then 25 DL of the 
hybridization buffer which was warmed at 42C was added to the probe, mixing by pipetting, to 
give a final concentration of: 

30 



50% formamide 



792 




PATENT 

Attorney Docket No. 2750-1481P 

4x SSC 
0.03% SDS 
5x Denhardt's solution 

0. 1 |ig/ml single-stranded salmon sperm DNA 

The probe was kept at 42C. Prior to the hybridization, the probe was heated for 1 more 
min., added to the array, and then covered with a glass cover slip. Slides were placed in 
hybridization chambers (Telechem, Sunnyvale, California) and incubated at 42°C overnight. 



10 Washing Conditions : 



m 
su 



20 



A. Slides were washed in Ix SSC + 0.03% SDS solution at room temperature for 5 minutes, 



iljl B. Slides were washed in 0.2x SSC at room temperature for 5 minutes, 
f 1 C. Slides were washed in 0.05x SSC at room temperature for 5 minutes. 

iiS 



After A, B, and C, slides were spun at 800 x g for 2 min. to dry. They were then scanned. 

Maize microarrays were hybridized according to the instructions included Fluorescent 
Linear Amplification Kit (cat. No. G2556A) from Agilent Technologies (Palo Alto, CA). 

SCANNING OF SLIDES 



The chips were scanned using a ScanArray 3000 or 5000 (General Scanning, Watertown, 
Massachusetts, USA). The chips were scanned at 543 and 633nm, at 10 um resolution to 
measure the intensity of the two fluorescent dyes incorporated into the samples hybridized to the 

25 chips. 

DATA EXTRACTION AND ANALYSIS 

The images generated by scanning slides consisted of two 16-bit TIFF images 
representing the fluorescent emissions of the two samples at each arrayed spot. These images 
were then quantified and processed for expression analysis using the data extraction software 

30 Imagene ™ (Biodiscovery, Los Angeles, California, USA). Imagene output was subsequently 
analyzed using the analysis program Genespring ™ (Silicon Genetics, San Carlos, California, 
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USA). In Genespring, the data was imported using median pixel intensity measurements derived 
from Imagene output. Background subtraction, ratio calculation and normalization were all 
conducted in Genespring. Normalization was achieved by breaking the data in to 32 groups, 
each of which represented one of the 32 pin printing regions on the microarray. Groups consist 

5 of 360 to 550 spots. Each group was independently normalized by setting the median of ratios to 

^ one and multiplying ratios by the appropriate factor. 

EXAMPLE 4: AFLP EXPERIMENTS AND RESULTS 

10 Production of Samples 
j M mRNA was prepared from 27 plant tissues. Based on preliminary cDNA-AFLP analysis 

hQ with a few primer combinations, 1 1 plant tissues and/or pooled samples were selected. Samples 

JJj were selected to give the greatest representation of unique band upon electrophoresis. The final 

'§\ 11 samples or pooled samples used in the cDNA-AFLP analysis were: 

m 

Jwi S 1 Dark adapted seedlings 

S2 Roots/Etiolated Seedlings 



:; p S3 Mature leaves, soil grown 

S4 Immature buds, inflorescence meristem 

20 S5 Flowers opened 

56 Siliques, all stages 

57 Senescing leaves (just beginning to yellow) 

58 Callus Inducing medium 
Callus shoot induction 

25 Callus root induction 

59 Wounding 

Methy 1-j asmonate-treated 
S10 Oxidative stress 

Drought stress 
30 Oxygen Stress-flooding 

SI 1 Heat treated light grown seedling 
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Cold treated light grown seedlings 

cDNA from each of the 1 1 samples was digested with two restriction endonucleases, 
namely Taql and Msel. Taql and Msel adapters were then ligated to the restriction enzyme 
5 fragments. Using primers to these adapters that were specific in sequence (i.e. without 
extensions), the restriction fragments were subjected to cycles of non-radioactive pre- 
amplification. 

Selective PCR 

10 In order to limit the number of fragments or bands on each lane of the AFLP gel, 

; m fragments were subjected to another round of selective radioactive polymerase chain 
hQ amplification. The Taql primers used in this amplification were 5 '-labelled with P . For these 
H p amplifications, the Taql primers had two extra nucleotides at their 3 ' end and the Msel primers 
; had three extra nucleotides at their 3' end. This resulted in 16 primer designs for the Taql primer 
1§1 and 64 primer designs for the Msel primer. Altogether, this gave rise to a total of 1024 primer 
^ designs. Fragments generated in this selective amplification protocol were run with labeled 

molecular weight markers on polyacrylamide gels to separate fragments in the size range of 100 

try 

Si j£ - 600 nucleotides. 

'p: Following gel electrophoresis, profiles were analyzed with a phosphoimager. From these 
20 images, electronic files, giving the mobilities of all bands on the gels and their intensities in each 
of the samples, were compiled. 

All unique bands were cut out of the gels. The gel pieces were placed in 96 well plates 
for elution and their plate designation was linked to their electrophoretic mobilities recorded in 
the electronic files. The eluted fragments were then subjected to another round of amplification, 
25 this time using reamplification primers (see below). After amplification, DNA fragments were 
sequenced. 

A computer database was established linking the mobilities of all the bands observed on 
the cDNA-AFLP gels with the sequence of the correspondingly isolated fragment. The sequence 
allowed for identification of the gene from which the cDNA-AFLP fragment was derived, 
30 allowing for a linkage of band mobility with the transcript of a specific gene. Also linked to the 



795 




PATENT 

Attorney Docket No. 2750-1481P 
band mobilities were their intensities recorded for each of the eleven samples used in 
constructing the database. 

This cDNA-AFLP analysis with TaqllMsel and 1024 primer combinations was repeated using 
the enzymes Nlalll in place of Taql, and Csp6I in place of Msel. 

5 

Using the Database for the Transcript Profiling of Experimental Samples 

Experimental Samples were subjected to cDNA-AFLP as described above, resulting in 
electronic files recording band mobilities and intensities. Through use of the database 
established above, band mobilities could be linked to specific cDNAs, and therefore genes. 
10 Furthermore, the linkage with the intensities in the respective samples allowed for the 
O quantification of specific cDNAs in these samples, and thus the relative concentration of specific 
transcripts in the samples, indicating the level to which specific genes were expressed. 

hi 

m 

flT Reamplification primers 

lS 99G24 

^ CGCCAGGGTTTTCCCAGTCACGACp4CG4C70C71gatgagtcctgagtaa| 

$3 Ml 3 forward +10 MseI+0 

III 

p 99G20 

20 AGCGGATAACAATTTCACACAGGA|C4C4CrGGL4|tagactgcgtaccga| 
Ml 3 reverse +10 TaqI +0 

Purification of the Reamplifiction reaction before sequencing 

25 5 ul reamplification reaction 
0,25 ul lOxPCR buffer 

0,33 ul Shrimp Alkaline Phosphatase (Amersham Life Science) 
0,033 ul Exonuclease I (USB) 
0,297 ul SAP dilution buffer 
30 1,59 ul MQ 
7.5 ul total 
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30' 37°C 
10' 80°C 
4°C 

Sample Preparation 

SI : Dark adapted seedlings : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots 
and left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature 
was 22°C, with 16 h light and 8 h dark. After 8 days, the seedlings were foil-wrapped and 
harvested after two days. 

S2: Roots/Etiolated seedlings: Seeds of Arabidopsis thaliana (wassilewskija) were germinated 
on solid germination media (IX MS salts, IX MS vitamins, 20g/L sucrose, 50 mg/L MES pH 
5.8) in the dark. Tissues were harvested 14 days later. 

S3: Mature leaves, soil grown: Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots 
and left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature 
was 22°C, with 16 h light and 8 h dark. Leaves were harvested 17 days later from plants that had 
not yet bolted. 

S4: Immature buds, inflorescence meristem : Seeds of Arabidopsis thaliana (wassilewskija) were 
sown in pots and left at 4°C for two to three days to vernalize. They were transferred to a growth 
chamber after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. 

S5: Flowers, opened: Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and left 
at 4°C for two to three days to vernalize. They were transferred to a growth chamber after three 
days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature was 22°C, 
with 1 6 h light and 8 h dark. 
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S6: Siliques, all stages : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and left 
at 4°C for two to three days to vernalize. They were transferred to a growth chamber after three 
days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature was 22°C, 
with 16 h light and 8 h dark. 

S7: Senescing leaves (just beginning to yellow): Seeds of Arabidopsis thaliana (wassilewskija) 
were sown in pots and left at 4°C for two to three days to vernalize. They were transferred to a 
growth chamber after three days. The intensity of light in the growth chamber was 7000-8000 
LUX, temperature was 22°C, with 16 h light and 8 h dark. When the plant had leaves that were 
less than 50% yellow, the leaves that were just beginning to yellow were harvested. 

S8: 

Callus Inducing Medium : Seeds of Arabidopsis thaliana (wassilewskija) were surface 
sterilized (1 min-75% Ethanol, 6 min-bleach 100% + Tween 20, rinse) and incubated on MS 
medium containing 2,4-Dichlorophenoxyacetic acid (2,4-D) 1 mg/1 and Kinetin 1 mg/1 in the 
dark for 3 weeks to generate primary callus. 

Hypocotyls and roots of the seedling were swollen after a week after incubation in this callus 
induction medium and subsequently callus was initiated from these swollen areas. 

Callus shoot induction : Primary calluses were transferred to the fresh callus induction 
medium for another 2 weeks growth to generate secondary callus. Secondary callus were 
transferred to shoot induction medium containing MS basal medium and Benzyladenine (BA) 2 
mg/1 and Naphthaleneacetic acid (NAA) ).l mg/1 for 2 weeks growth in the light before it was 
harvested and frozen and sent to Keygene. Many shoot meristems were observed under the 
microscope. 

Callus root induction : Secondary calluses were transferred to root induction medium 
containing MS basal medium, sucrose 1% and Indolebutyric acid (IB A) 0.05 mg/1 in the dark. 
Many root primordia were observed under microscope after 10 days in the root induction 
medium. Those callus tissue were harvested and frozen and sent to Keygene. 

S9: 
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Wounding: Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were transferred to a growth chamber after three 
days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature was 22°C 5 
with 16 h light and 8 h dark. After 20 days, leaves of plants were wounded with pliers. Wounded 
5 leaves were harvested I hour and 4 hours after wounding. 

Methyl jasmonate treatment: Seeds of Arabidopsis thaliana (wassilewskija) were sown in 
pots and left at 4°C for two to three days to vernalize. They were transferred to a growth 
chamber after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
10 temperature was 22°C, with 16 h light and 8 h dark. After 13 days, plants were sprayed with 
7q 0.001% methyl jasmonate. Leaves were harvested 1 .5 hours and 6 hours after spraying 

Ijl S10: 

4 j Oxidative stress : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and 

15^ 1 left at 4°C for two to three days to vernalize. They were transferred to a growth chamber after 
O three days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature was 

22°C, with 16 h light and 8 h dark. After 24 days, a few leaves were inoculated with a mixture of 
jj* 2.5 mM D-glucose, 2.5 U/mL glucose oxidase in 20 mM sodium phosphate buffer pH 6.5. After 
IM« an hour, 3 hours, or 5 hours after inoculation, whole plant, except for the inoculated leaves, was 
20 harvested. This sample was mixed with sample from plants that were sitting in full sun (152,000 
LUX) for 2 hours or four hours. 

Drought stress : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and left 
at 4°C for two to three days to vernalize. They were transferred to a growth chamber after three 
25 days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature was 22°C, 
with 16 h light and 8 h dark. After 20 days, aerial tissues were harvested and left to dry in 3MM 
Whatman paper for 1 hour or 4 hours. 

Oxygen stress : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and left 

30 at 4°C for two to three days to vernalize. They were transferred to a growth chamber after three 

days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature was 22°C, 
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with 16 h light and 8 h dark. After 21 days, the plant was flooded by immersing its pot in a 
beaker of tap water. After 6 days, the upper tissues were harvested. 

SI 1 : Heat-treated light grown seedlings: Seeds of Arabidopsis thaliana (wassilewskija) were 
5 sown in pots and left at 4°C for two to three days to vernalize. They were transferred to a growth 
chamber after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. Over a 5 hour period, the temperature was 
raised to 42°C at the rate of approximately 4°C per hour. After 1 hour at 42°C, the aerial tissues 
were collected. This sample was mixed with an equal volume of sample that went through a 
10 heat-recovery treatment namely bringing down the temperature to 22°C from 42°C over a 5 hour 
h Q period at the rate of 4°C per hour. 

y 

Cold-treated light grown seedlings: Seeds of Arabidopsis thaliana (wassilewskija) were 
sown in pots and left at 4°C for two to three days to vernalize. They were transferred to a growth 
1 p 1 chamber after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
W temperature was 22°C, with 16 h light and 8 h dark. After 1 8 days, the plant was transferred to 
I1J 4°C for an hour before the aerial tissues were harvested. This sample was mixed with aerial 
tissues from another plant that was transferred to 4°C for 27 hours before being harvested. 

20 Analysis of Data: 

Intensity : The intensity of the band corresponds to the value in each lane marked SI, S2 etc. 

P- values : The data shows P- values of each of the samples 1-11. P-values are calculated using 
the following formula 2*(l-NORMDIST(ABS(Sx-AVERAGE(of SI to SI 1, not including 
25 Sx))/STDEV(of S 1 to S 1 1 not including Sx),0, 1 ,TRUE)) using Excel functions. 

The equivalent mathematical formula of P-value is as follows: 
f cp(x) dx ? integrated from a to oo, 

where <p(x) is a normal distribution: 
30 where a = |Sx-|li| 
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a(S 1 . . . S 1 1 , not including Sx); 
where ja = is the average of the intensities of all samples except Sx, 
= (ZSl...SnVSx 
n-1 

where a(S 1 . . . S 1 1 , not including Sx) = the standard deviation of all sample intensities 
except Sx. 

Results : 

The results are shown in the MA_diff and/or AFLP_diff AFLP_int and AFLP_diff tables. 

EXAMPLE 5: TRANSFORMATION OF CARROT CELLS 

Transformation of plant cells can be accomplished by a number of methods, as described 
above. Similarly, a number of plant genera can be regenerated from tissue culture following 
transformation. Transformation and regeneration of carrot cells as described herein is 
illustrative. 

Single cell suspension cultures of carrot {Daucus carotd) cells are established from 
hypocotyls of cultivar Early Nantes in B 5 growth medium (O.L. Gamborg et al., Plant Physiol 
45:372 (1970)) plus 2,4-D and 15 mM CaCl 2 (B 5 -44 medium) by methods known in the art. The 
suspension cultures are subcultured by adding 10 ml of the suspension culture to 40 ml of B 5 -44 
medium in 250 ml flasks every 7 days and are maintained in a shaker at 150 rpm at 27 °C in the 
dark. 

The suspension culture cells are transformed with exogenous DNA as described by Z. 
Chen et al. Plant Mol Bio. 36: 163 (1998). Briefly, 4-days post-subculture cells are incubated 
with cell wall digestion solution containing 0.4 M sorbitol, 2% driselase, 5mM MES (2-[N- 
Morpholino] ethanesulfonic acid) pH 5.0 for 5 hours. The digested cells are pelleted gently at 60 
xg for 5 min. and washed twice in W5 solution containing 154 mM NaCl, 5 mM KC1, 125 mM 
CaCl 2 and 5mM glucose, pH 6.0. The protoplasts are suspended in MC solution containing 5 
mM MES, 20 mM CaCl 2? 0.5 M mannitol, pH 5.7 and the protoplast density is adjusted to about 
4 x 10 6 protoplasts per ml. 

15-60 |ag of plasmid DNA is mixed with 0.9 ml of protoplasts. The resulting suspension 
is mixed with 40% polyethylene glycol (MW 8000, PEG 8000), by gentle inversion a few times 
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at room temperature for 5 to 25 min. Protoplast culture medium known in the art is added into 
the PEG-DNA-protoplast mixture. Protoplasts are incubated in the culture medium for 24 hour to 
5 days and cell extracts can be used for assay of transient expression of the introduced gene. 
Alternatively, transformed cells can be used to produce transgenic callus, which in turn can be 
used to produce transgenic plants, by methods known in the art. See, for example, Nomura and 
Komamine, Pit. Phys. 79:988-991 (1985), Identification and Isolation of Single Cells that 
Produce Somatic Embryos in Carrot Suspension Cultures. 

EXAMPLE 6: PHENOTYPE SCREENS AND RESULTS 

A: Triparental Mating and Vacuum Infiltration Transformation of Plants 

Standard laboratory techniques are as described in Sambrook et al. (1989) unless 
otherwise stated. Single colonies of Agrobacterium C58ClRif, E. coli helper strain HB101 and 
the E. coli strain containing the transformation construct to be mobilized into Agrobacterium 
were separately inoculated into appropriate growth media and stationary cultures produced. 100 
[il of each of the three cultures were mixed gently, plated on YEB (5g Gibco beef extract, lg 
Bacto yeast extract, lg Bacto peptone, 5g sucrose, pH 7.4) solid growth media and incubated 
overnight at 28°C. The bacteria from the triparental mating were collected in 2 ml of lambda 
buffer (20 mM Tris (pH 7.5), 100 mM NaCl, 10 mM MgCl 2 ) and serial dilutions made. An 
aliquot of the each dilution was then plated and incubated for 2 days at 28°C on YEB plates 
supplemented with 100 ^ig/ml rifampicin and 100 ^ig/ml carbenicillin for calculation of the 
number of acceptor cells and on YEB plates supplemented with 100 |ig/ml rifampicin, 100 (ag/ml 
carbenicillin and 100 (ig/ml spectinomycin for selection of transconjugant cells. The cointegrate 
structure of purified transconjugants was verified via Southern blot hybridization. 

A transconjugant culture was prepared for vacuum infiltration by inoculating 1 ml of a 
stationary culture arising from a single colony into liquid YEB media and incubating at 28°C for 
approximately 20 hours with shaking (220 rpm) until the OD taken at 600 nm was 0.8-1.0. The 
culture was then pelleted (8000 rpm, 10 min, 4°C in a Sorvall SLA 3000 rotor) and the bacteria 
resuspended in infiltration medium (0.5X MS salts, 5% w/v sucrose, 10 ng/1 BAP, 200 ^1/1 
Silwet L-77, pH 5.8) to a final OD 6 oo of 1 .0. This prepared transconjugant culture was used 
within 20 minutes of preparation. 
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Wild-type plants for vacuum infiltration were grown in 4-inch pots containing Metromix 
200 and Osmocote. Briefly, seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown 
in pots and left at 4°C for two to four days to vernalize. They were then transferred to 22-25°C 
and grown under long-day (16 hr light: 8 hr dark) conditions, sub-irrigated with water. After 
5 bolting, the primary inflorescence was removed and, after four to eight days, the pots containing 
the plants were inverted in the vacuum chamber to submerge all of the plants in the prepared 
transconjugant culture. Vacuum was drawn for two minutes before pots were removed, covered 
with plastic wrap and incubated in a cool room under darkness or very low light for one to two 
days. The plastic wrap was then removed, the plants returned to their previous growing 
10 conditions and subsequently produced (Tl) seed collected. 

hQ B: Selection of T-DNA Insertion Lines 

W 

m 

;^ Approximately 10,750 seeds from the initial vacuum infiltrated plants were sown per flat 

1§1 of Metromix 350 soil. Flats were vernalized for four to five days at 4°C before being transferred 

% to 

W 22-25°C and grown under long-day (16 hr light: 8 hr dark) conditions, sub-irrigated with 

water. Approximately seven to ten days after germination, the (Tl) seedlings were sprayed with 
0.02% Finale herbicide (AgrEvo). After another five to seven days, herbicide treatment was 
20 repeated. Herbicide resistant Tl plants were allowed to self-pollinate and T2 seed were collected 
from each individual. In the few cases where the Tl plant produced few seed, the T2 seed was 
planted in bulk, the T2 plants allowed to self-pollinate and T3 seed collected. 

C: Phenotype Screening 

25 

Approximately 40 seed from each T2 (or T3) line were planted in a 4-inch pot containing 
either Sunshine mix or Metromix 350 soil. Pots were vernalized for four to five days at 4°C 
before being transferred to 22-25°C and grown under long-day (16 hr light: 8 hr dark) conditions, 
sub-irrigated with water. A first phenotype screen was conducted by visually inspecting the 
30 seedlings five to seven days after germination and aberrant phenotypes noted. Plants were then 
sprayed with Finale herbicide within four days (i.e. about seven to nine days after germination). 
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The second visual screen was conducted on surviving T2 (or T3) plants about sixteen to 
seventeen days after germination and the final screen was conducted after the plants had bolted 
and formed siliques. Here, the third and fourth green siliques were collected and aberrant 
phenotypes noted. The Knock-in and Knock-out Tables contain descriptions of identified 
5 phenotypes. 

Alternative, seed were surface sterilized and transferred to agar solidified medium 
containing Murashige and Skoog salts (IX), 1% sucrose (wt/v) pH 5.7 before autoclaving. Seed 
were cold treated for 48 hours and transferred to long days [16 hours light and 8 hours dark], 25°C. 
Plants were screened at 5 and 10 days. 
10 In another screen, seed were surface sterilized and transferred to agar solidified medium 

containing Murashige and Skoog salts (IX), and combinations of various nitrogen and sucrose 
amounts as specified below:: 
J Medium 1 : no sucrose, 20.6 mM NH4NO3, 1 8.8 mM KN0 3 ; 

jp Medium 2: 0.5% sucrose, 20.6 mM NH4NO3, 1 8.8 mM KN0 3 ; 

1 11 Medium 3: 3% sucrose, 20.6 mM NH4NO3, 1 8.8 mM KN0 3 ; 

J* Medium 4: no sucrose, 20.6 uM NH4NO3, 1 8.8 uM KNO3; 

Medium 5: 0.5% sucrose, 20.6 |iM NH 4 N0 3? 18.8 jaM KN0 3 ; and 
Medium 6: 3% sucrose, 20.6 uM NH 4 N0 33 18.8 |iM KNO3. 

O 

{£ The 0.5% sucrose was the control concentration for the sucrose. The low nitrogene, 20.6 |uM 
20 NH 4 N0 35 1 8.8 \iM KN0 3 , is the control for the nitrogen. Seed were cold treated for 48 hours and 
transferred to long days [16 hours light and 8 hours dark], 25°C. Plants were screened at 2, 5, and 
10 days. 



25 



D: TAIL-PCR and Fragment Sequencing 



Rosette leaves were collected from each putative mutant and crushed between parafilm 
and FTA paper (Life Technologies). Two 2mm 2 hole punches were isolated from each FTA 
sample and washed according to the manufacturer's instructions by vortexing with 200 ul of the 
provided FTA purification reagent. The FTA reagent was removed and the washing procedure 
30 repeated two more times. The sample was then washed twice with 200 ul of FTA TE (10 mM 
Tris, 0.1 mM EDTA, pH 8.0) and vortexing prior to PCR. 
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Primers used for TAIL-PCR are as follows: 

AD2: 5' NGTCGASWGANAWGAA 3' (128-fold degeneracy) 
S = G or C, W = A or T, and N = A, G, C, or T 

5 

LB 1 : 5' GTTTAACTGCGGCTC AACTGTCT 3' 
LB2: 5' CCCATAGACCCTTACCGCTTTAGTT 3' 
10 LB3: 5' GAAAGAAAAAGAGGTATAACTGGTA 3* 

a 

The extent to which the left and right borders of the T-DNA insert were intact was 
measured for each line by PCR. The following components were mixed for PCR: 1 2mm 2 FTA 
Wi sample, 38.75 \d distilled water, 5 \d 10X Platinum PCR buffer (Life Technologies), 2 \d 50 mM 
l;|j MgCl 2 , 1 jal 10 mM dNTPs, 1 |nl 10 primer LB1 (or RBI for analysis of the right border), 1 

^1 10 primer LB3R (or RB3R for analysis of the right border) and 1 .25 U Platinum Taq (Life 
fg Technologies). Cycling conditions were: 94°C, 10 sec; thirty cycles of 94°C, 1 sec. - 54°C, 1 sec. 
p - 72°C, 1 sec; 72°C, 4 sec The expected band size for an intact left border is bp, while an intact 
O right border generates a bp band. 
20 

Fragments containing left or right border T-DNA sequence and adjacent genomic DNA 
sequence were obtained via PCR. First product PCR reactions use the following reaction 
mixture: 1 2mm 2 FTA sample, 12.44 jil distilled water, 2 yd 10X Platinum PCR buffer (Life 
Technologies), 0.6 |al 50 mM MgCl 2 , 0.4 (il 10 mM dNTPs, 0.4 jil 10 |aM primer LB1 (or RBI 

25 for analysis of the right border), 3 jil 20 |^M primer AD2 and 0.8 U Platinum Taq (Life 

Technologies). Cycling conditions for these reactions were: 93°C, 1 min.; 95°C, 1 min.; three 
cycles of 94°C, 45 sec - 62°C, 1 min. - 72°C, 2.5 min.; 94°C,45 sec; 25°C, 3 min.; ramp to 72°C 
in 3 min.; 72°C, 2.5 min.; fourteen cycles of 94°C, 20 sec - 68°C, 1 min. - 72°C, 2.5 min. - 94°C, 
20 sec; - 68°C, 1 min. - 72°C, 2.5 min. - 94°C, 20 sec - 44°C, 1 min. - 72°C, 2.5 min.; 72°C, 5 

30 min.; end; -4.5 hrs. For second product PCR reactions 1 jliI of a 1 :50 dilution of the first PCR 
product reaction was mixed with 13.44 jil distilled water, 2 jal 10X Platinum PCR buffer (Life 
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Technologies), 0.6 [il 50 mM MgCl 2 , 0.4 y,\ 10 mM dNTPs, 0.4 [i\ 10 (iM primer LB2 (or RB2 
for analysis of the right border), 2 ^1 20 jxM primer AD2 and 0.8 U Platinum Taq (Life 
Technologies). Second product cycling conditions were: eleven cycles of 94°C, 20 sec. - 64°C, 1 
min. - 72°C ? 2.5 min. - 94°C, 20 sec. - 64°C, 1 min. - 72°C, 2.5 min. - 94°C, 20 sec. - 44°C, 1 
min.; 72°C, 5 min.; end; ~3 hrs. Third product PCR reactions were prepared by first diluting 2 ^1 
of the second PCR product with 98 jil of distilled water and then adding 1 |^1 of the dilution to 
13.44 |il distilled water, 2 jil 10X Platinum PCR buffer (Life Technologies), 0.6 jal 50 mM 
MgCl 2 , 0.4 [il 10 mM dNTPs, 0.4 jal 10 [iM primer LB3 (or RB3 for analysis of the right border), 
2 jil 20 (iM primer AD2 and 0.8 U Platinum Taq (Life Technologies). Third product cycling 
conditions were: twenty cycles of 94°C, 38 sec. - 44°C, 1 min. - 72°C, 2.5 min.; 72°C, 5 min.; 
end; -2 hrs. Aliquots of the first, second and third PCR products were electrophoresed on 1 % 
TAE (40 mM Tris-acetate, 1 mM EDTA) to determine their size. 

Reactions were purified prior to sequencing by conducting a final PCR reaction. Here, 
0.25 |al Platinum PCR Buffer (Life Technologies), 0.1 \d 50 mM MgCl 2 , 3.3 U SAP shrimp 
alkaline phosphatase, 0.33 U Exonuclease and 1.781 (il distilled water were added to a 5 |xl third 
product and the reaction cycled at 37°C, 30 min.; 80°C, 10 min.; 4°C indefinitely. 

Di-deoxy "Big Dye" sequencing was conducted on Perkin-Elmer 3700 or 377 machines. 

KNOCK-IN EXPERIMENTS 

For the following examples, a two-component system was constructed in a plant to 
ectopically express the desired cDNA. 

First, a plant was generated by inserting a sequence encoding a transcriptional activator 
downstream of a desired promoter, thereby creating a first component where the desired 
promoter facilitates expression of the activator generated a plant. The first component also is 
referred to as the activator line. 

Next, the second component is constructed by linking a desired cDNA to a sequence that 
the transcriptional activator can bind to and facilitate expression of the desired cDNA. The 
second component can be inserted into the activator line by transformation. Alternatively, the 
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second component can be inserted into a separate plant, also referred to as the target line. Then, 
the target and activator lines can be crossed to generate progeny that have both components. 



Two component lines were generated by both means. 

5 

Part I - From crosses 

Target lines containing cDNA constructs are generated using the Agrobacterium-mediated 
transformation. Selected target lines are genetically crossed to activation lines (or promoter 
1 0 lines). Generally, the promoter lines used are as described above. Evaluation of phenotypes is 
O done on the resulting Fl progenies. 



Part II - From Type I Supertransformation 

m 



Promoter activation lines (generally Vascular/Ovule/Young Seed/Embryo line, 



33 Seed/Epidermis/Ovary/Fruit line, Roots/Shoots/Ovule line, and Vasculature/Meristem are 
U 

I* transformed with cDNA constructs using the Agrobacterium mediated transformation. Selected 



;M transformants (and their progenies) are evaluated for changes in phenotypes. The table for the 
20 knock-in of the Type I supertransformation comprises the following information 

Clone ID, 
Pfam, 
Gemini ID 

Trans. Unique ID (which indicates what promoter activation line was 
25 transformed 

S Ratio: segregation ratio after the transformed plants are selected for the 

marker. 

Assay 

Stage: phenotype was observed 
30 - Feature: Where the phenotype was observed 

Phenotype 
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P Ratio: phenotype ratio 
Comments 



5 Part III - From Type II Supertransformation 

Target lines generated using the procedure mentioned in Part I are transformed with T-DNA 
construct containing constitutive promoter. Selected transformants (and their progenies) are 
evaluated for changes in phenotypes. 

10 

-13 An additional deposit of an E. coli Library, E. co//LibA021800, was made at the 

American Type Culture Collection in Manassas, Virginia, USA on February 22, 2000 to meet the 
;9 ; requirements of Budapest Treaty for the international recognition of the deposit of 
llgl microorganisms. This deposit was assigned ATCC accession no. PTA-141 1 . 
J*. Additionaly, ATCC Library deposits; PTA-1 161, PTA-141 1 and PTA-2007 were made at the 
03 American Type Culture Collection in Manassas, Virginia, USA on; January 7, 2000, February 

w 

23, 2000 and June 8, 2000 respectively, to meet the requirements of Budapest Treaty for the 
^ international recognition of the deposit of microorganisms. 

20 The invention being thus described, it will be apparent to one of ordinary skill in the art 

that various modifications of the materials and methods for practicing the invention can be made. 
Such modifications are to be considered within the scope of the invention as defined by the 
following claims. 

Each of the references from the patent and periodical literature cited herein is hereby 
25 expressly incorporated in its entirety by such citation. 
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